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ABSTRACT 

The isolation and characterization of a human functional GRP78 gene and a processed pseudogene are de- 
scribed. We present the complete primary structure of the human GRP78 gene, which spans over 5 kb and 
consists of eight exons. Sequence comparisons reveal that the GRP78 gene shares unusual homology among 
the human, rat, and hamster in the protein-coding and 3' untranslated regions. In addition, short domains 
highly conserved with HSP70 isolated from human, Drosophila, Xenopus, yeast, and E. coli DNA are iden- 
tified within the hydrophobic regions of GRP78. The intronless pseudogene resembles that of a processed 
gene. It is flanked by a short direct repeat and is embedded within an AT-rich genomic region. The highly 
active promoter from the functional human GRP78 gene contains a TATA box, five CCAAT sequences, 
and two potential binding sites for the transcriptional factor Spl. It consists of a distal domain that enhances 
basal level expression and a proximal domain essential for responses to calcium ionophore and for a tem- 
perature-sensitive mutation which induce the GRP78 gene. Both domains are highly conserved between the 
rat and the human GRP78 promoters. 



INTRODUCTION 

THE GLUCOSE- REGULATED PROTEINS (GRPs) Were first 
observed in fibroblasts following transformation with 
avian sarcoma viruses. Subsequently, it was found that the 
increased synthesis of these proteins was primarily due to 
the depletion of glucose and a variety of physiological 
stress conditions can also induce the synthesis of the 
GRPs. Potent inducers of the GRPs include calcium iono- 
phores, sulfhydryl-reducing reagents, and reagents which 
block protein glycosylation (for review, see Lee, 1987). 
" The most abundant GRP is a 78,000-dalton protein lo- 
calized in the endoplasmic reticulum (ER) (Zala et aL, 
1980). It shares partial amino acid sequence homology 
with the 70,000-dalton heat shock protein (HSP70) and 
was recently identified as the immunoglobulin (Ig) heavy- 
chain binding protein in B lymphocytes (Munro and Pel- 
ham, 1986). Evidence has since accumulated showing that 
GRP78 is ubiquitous and can bind to a variety of proteins 



processed through the ER (Gething et al. y 1986; Hender- 
shot, Bole, and Kearney, in prep.; Nakaki, Deans, and 
Lee, in prep.). Therefore, one plausible function of 
GRP78 is that under normal conditions, it may be part of 
a protein-processing machinery in the ER (Bole et aL, 
1986); during stress, it may bind to aberrant or undergly- 
cosylated proteins and prevent their aggregation (Pelham, 
1986). 

To understand the structure and regulation of the GRP 
genes, the cDNA clones encoding GRP78 and GRP94 have 
been isolated from the hamster (Lee et ai, 1981). These 
molecular probes were used to establish that the genes en- 
coding the GRP78 and GRP94 were regulated coordinately 
at the transcriptional level during glucose starvation, cal- 
cium ionophore treatment, and in a temperature-sensitive 
(ts) hamster cell line K12 which is blocked in protein gly- 
cosylation at 40°C (Lee et a!., 1983, 1986; Lin and Lee, 
1984; Resendez et ai y 1985). The cloning and function of 
the rat GRP78 promoter have also been described (Atte- 
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nello and Lee, 1984; Chang et aL, 1987). The rat GRP78 
promoter contains a highly active enhancer and is capable 
of conferring glucose starvation and calcium ionophore 
A23187 induction upon heterologous genes (Attenello and 
Lee, 1984; Resendez et aL, 1985; Lin et aL , 1986). Further, 
transfection of these fusion genes into the K12 ts mutant 
cell line results in their induction at 40°C (Attenello and 
Lee, 1984; Chang et aL, 1987). These combined results 
strongly indicate the existence of cis-acting elements in the 
GRP78 promoter responsive to specific stress conditions. 

A powerful approach to identify important functional 
domains in the GRP78 promoter and coding sequences is 
through direct comparisons of the GRP78 and other re- 
lated genes derived from different species and searching 
for evolutionary conserved regions. Besides GRP78, the 
gene encoding hsc73, the major HSP70 cognate protein in 
unstressed rat cells has been isolated from the rat genome 
(Sorger and Pelham, 1987). The isolation of the human 
heat-inducible HSP70 gene has been described (Wu et aL , 
1985). In this report, we describe the isolation and determi- 
nation of the complete primary structure of the human 
GRP78 gene. Our analysis reveals the existence of a func- 
tional gene and a processed pseudogene of GRP78 in the 
human genome. The functional gene contains eight exons 
and shares unusual sequence homology with the rat and 
hamster GRP78 gene. In addition, short domains highly 
conserved with HSP70 isolated from human, Drosophila, 
Xenopus, yeast and £*. coli DNA are identified within the 
hydrophobic regions of GRP78. The pseudogene resembles 
that of a processed gene and is embedded within an AT- 
rich genomic region. The promoter from the functional 
GRP78 gene contains a TATA box and five CCAAT se- 
quences. It consists of a distal domain which enhances 
basal level expression and a proximal domain essential for 
responses to various stimuli. Both domains are highly con- 
served between the rat and the human GRP78 promoter. 



MATERIALS AND METHODS 

Cell lines and culture conditions 

The temperature sensitive (ts) Chinese hamster fibro- 
blast cell line K12 has been described (Roscoe et aL, 1973; 
Lee, 1981). It was routinely maintained in DMEM (4.5 
mg/ml glucose) supplemented with 10% cadet calf serum. 
The HeLa D98 AH2 monolayer cells were obtained from 
R.E.K. Fournier (USC) and were maintained in DMEM 
with 10% fetal calf serum. 



Treatment conditions 

HeLa cells grown in 150-mm diameter dishes to 90% 
confiuency were changed to fresh medium. The cells were 
either treated with 7 \M A23187 for 5 hr (Resendez et aL, 
1985) or grown in glucose-depleted DMEM for 16 hr (Lin 
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Isolation of human GRP78 genes 

The human fetal liver genomic library (Lawn et aL, 
1978) was screened using a full-length cDNA plasmid p3C5 
which encodes the hamster GRP78 (Lee et aL, 1981). Fil- 
ters containing about 10 6 recombinant phage were prehy- 
bridized in 50% formamide, 5x SSC (1 x SSC: 0.15 M 
NaCl, 15 mM sodium citrate), 5x Denhardt (1 x Den- 
hardt: 0.02% each of bovine serum albumin, polyvinylpyr- 
rolidone, and Ficoll), 0.1% NaDodS0 4 , 50 mM sodium 
phosphate buffer pH 6.8, 1% glycine, and 100 /tg/ml de- 
natured salmon sperm DNA at 42°C for 1 hr. They were 
then hybridized in the same buffer with nick-translated 
p3C5 probe (sp. act. 7 x 10 7 cpm/^g; total cpm used 1.5 
x 10 8 ) overnight at 42°C. After hybridization, the filters 
were washed three times at 50° C for 60 min each in 5x 
Denhardt, 3x SSC, 0.1% NaDodS0 4 , and 0.1% sodium 
pyrophosphate; and 2 times at 50°C for 60 min each in 1 x 
SSC, 0.1% NaDodS0 4 , and 0.1% sodium pyrophosphate. 
Six recombinant phage hybridized positively with the p3C5 
probe. 



Si nuclease protection 

Cytoplasmic RNA was isolated from control and in- 
duced HeLa cells as described previously (Resendez et aL , 
1985). The DNA probe used for the Si protection assay 
was a 497-nucleotide Sma I fragment corresponding to nu- 
cleotides -368 to 129 (Figs. 1 and 4). The DNA was 5'-end- 
labeled by T4 polynucleotide kinase (BRL) to a specific ac- 
tivity 6 x 10 6 cpm/^g of DNA. About 4 x 10 s cpm of the 
heat-denatured DNA was hybridized with 30 fig of HeLa 
RNA in 80% formamide, 0.4 M NaCl, 40 mM PIPES 
[piperazine-N,N'-bis(2-ethanesulfonic acid)] pH 6.4, and 1 
mA/EDTA at 58°C for 16 hr. The DNA-RNA hybrid was 
then digested with 20 units of S! nuclease (PL Biochemi- 
cals) in 200 pi of reaction mixture containing 0.28 M NaCl, 
50 mM sodium acetate pH 4.6, 4.5 mM ZnS0 4 , and 50 fig/ 
ml of denatured salmon sperm DNA at 37°C for 30 min. 
Undigested DNA was precipitated with alcohol and elec- 
trophoresed on a 6% polyacrylamide sequencing gel. 
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Primer extension 

The primer used was a synthetic oligonucleotide (a 17- 
mer, with sequence 5-GGCCGCGACGCTTACCT-3) 
corresponding to anti-sense nucleotides 24-40 of the rat 
GRP78 5' UTR (Chang et aL, 1987). The primer was la- 
beled by T4 polynucleotide kinase to a specific activity of 
1.5 x 10 8 cpm//ig DNA. About 10 6 cpm of the primer was 
hybridized with 50 fig of HeLa RNA. The hybridization 
and extension reactions were performed as described (Lin 
et aL , 1986), with the exception that hybridization was per- 
formed at 25°C for 5 hr in the same hybridization buffer 
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FIG. 1. Restriction map and exon structure of the GRP78 gene from clone hu28-l . The lower line shows the restriction 
map of the 24-kb human genomic DNA within the phage. The upper line shows the expanded exon structure of the 
GRP78 gene, with the 5' and 3' untranslated regions as open boxes, and the coding region in black, numbered boxes. The 
TATA box and the poly(A) addition sequence AATAAA are indicated. Restriction enzyme sites marked are: Bam HI(B), 
Bgl II (Bg), Eco RI (E), Hind III (H), Nru I (N), Pst I (P), Pvu II (Pv), Sal I(S), Sma I (Sm), and Sph I (Sp). 

) 



Construction of hybrid genes 

To construct plasmids phu78CAT (-1650), phu78CAT 
(-368), and phu78CAT (-170), DNA fragments Sph I- 
Nru I (-1650 to 53), Sma l-Sma I (-368 to 129), or Pvu 
II-Nrw I (-170 to 53) containing the human GRP78 pro- 
moter and 5' UTR sequences (Fig. 1) were blunt-ended 
with T4 DNA polymerase and ligated into the unique Hind 
III site (blunt-ended with T4 DNA polymerase) of the 
pSVOCAT vector (Resendez et al. y 1985). Plasmids con- 
taining the GRP78 promoter and CAT gene fused in the 
same transcriptional orientation were identified by restric- 
tion mapping. 



Transient transfection conditions 
and CAT assay 

Transfection of DNA into K12 cells and the assays for 
the chloramphenicol acetyltransferase (CAT) activity have 
been described (Resendez et aL, 1985). 



DNA sequence analysis 

The DNA contained within the phage insert was restric- 
tion-mapped. Purified DNA fragments were subcloned 
into M13 vector and sequenced by the dideoxy chain-ter- 
mination method (Sanger et at., 1977). Most of the DNA 
fragments were sequenced on both strands. 



Computer programs 

Analysis of the DNA sequences was aided by the Intelli- 
genetics Bionet program. The protein hydropathicity was 
analyzed by using the University of Wisconsin Genetics 
Computer Group (UWGGG) program. 



RESULTS 

Isolation of two types of human GRP78 genes 

A human genomic library was screened with a full- 
length cDNA probe encoding the hamster GRP78 tran- 
scriptional unit. Six positive plaques were isolated; they 
represented five different X clones based on limited restric- 
tion mapping. The clones can be further grouped into two 
restriction patterns, indicating that two different GRP78 
genes derived from different regions of the human genome 
have been isolated. The recombinant phage, hu28-l and 
hu3-2B, representing the two types, were selected for de- 
tailed restriction mapping and probes derived from the 
hamster cDNA were used to identify the limits of the cod- 
ing sequences in the X insert (Figs. 1 and 2). The hu28-l 
clone contains a GRP78 gene spanning 5 kb interrupted by 
intervening sequences, whereas hu3-2B contains a contigu- 
ous coding region within 2 kb. 



The structure of the functional GRP78 gene 

To determine if the GRP78 gene contained within 
hu28-l is functional, we searched for canonical promoter 
and transcriptional initiation and termination elements at 
the 5' and 3' borders of this gene. The transcriptional start 
site was mapped by both S, nuclease protection assay and 
primer extension (Fig. 3A). For this purpose, total cyto- 
plasmic RNA was isolated from HeLa cells induced by cal- 
cium ionophore A23187 or glucose starvation. As a con- 
trol, RNA was also extracted from noninduced cells. In the 
Sj assay, RNA samples were hybridized with a 497-nucleo- 
tide Sma I fragment containing the 5' border of the GRP78 
gene. As shown in Fig. 3B, the major S x -resistant band has 
a size of 129 nucleotides. Using the primer extension 
method, a discrete band of 80 nucleotides corresponding 
to the site mapped by Si protection was detected in the in- 
duced RNA sample (Fig. 3C). The smaller inducible prod- 
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CAACACACTTCTAAATAACATATAGATCTAAAAAAACCATGACAGAAATTATAAATATTTTGAACCAAATGAAAACTAAACCACAACACATCAAAATGTG 
TGAAGTGTAGCTAAATCAATGCTTAGAAATTTATAGCATTAAATGTTTATATTAGAAAATCATAAAATTCTGA 3' 

FIG 2 The processed pseudogene from clone hu 3-2B and the sequence surrounding the insertion site. A. The lower 
line shows the restriction map of the 15-kb human genomic DNA insert within the phage. The upper line shows the ex- 
panded region spanning the 4.4-kb Pst I fragment containing GRP78 coding sequences (black box) and untranslated re- 
gions (open boxes). The restriction sites marked are A va I (A), Apa I (Ap), Bam HI (B), Bgl II (Bg), Bst EII (Bs), Clc r I 
(C) Eco RI (E) Hae II (Ha), Nae I (Na), Nco I (Nc), Nde I (Nd), Pvu II (Pv), Pst I (P), Sac I (Sa), Xba I (X). B. The 
nucleotide sequences immediately surrounding the pseudogene. The 13-nucleotide direct repeat sequence flanking the in- 
serted gene is marked by arrows. 



uct (63 nucleotides) is probably due to premature stop of 
the reverse transcriptase. These combined results identified 
the transcriptional initiation site as 129 nucleotides up- 
stream from the 3 Sma I site. Sequencing the DNA 5' of 
the cap site revealed a TATA sequence 25 nucleotides up- 
stream. In addition, there are five CCAAT sequences lo- 
cated within 250 nucleotides upstream from the RNA initi- 
ation site. Two potential Spl binding sites were localized at 
-228 and -164. At the 3' end of the gene, a polyadenyla- 
tion sequence AATAAA was also found. All of these fea- 
tures resemble that of a functional gene. 

To understand in detail the organization of the GRP78 
gene, the complete primary structure of the human GRP78 
gene contained in the hu28-l clone was determined by 
DNA sequencing. This sequence data, combined with the 
previously obtained full-length cDNA sequence of the rat 
and hamster GRP78 (Munro and Pelham, 1986; Ting et 
aiy 1987), enabled us to delineate the locations of exon- 
intron junctions. As shown in Figs. 1 and 4, clone hu28-l 
contains a gene comprising the entire GRP78 coding se- 
quence interrupted by seven intervening sequences. All the 
exon-intron junctions followed the GT-AG splicing rule 



3?. 



ft 



3,i 



GRP78 gene (S.K. Wooden, R.P. Kapur, and A.S. Lee, in 
prep.), the number and location of the exons are identical. 
However, the sizes of the intervening sequences are similar 
but not identical between the rat and human. 

GRP78 protein contains highly 
conserved domains ( 

The complete amino acid sequence was deduced from 
the determined nucleotide sequence and was subjected to 
the hydropathicity analysis (Kyte and Doolittle, 1982). As 
shown in Fig. 5, this protein is very hydrophilic, especially 
toward the carboxyl half of the protein. Only a few small 
regions were identified to be hydrophobic. Upon compari- 
son of the entire amino acid sequences derived from the 
human GRP78, rat GRP78, and HSP70 from human, 
-Drosophila, Xenopus, yeast and E. coli dnaK gene, a few 
regions (A-H) of high sequence homology (over 80%) can 
be detected (Fig. 5). Similar conservations were observed 
with the rat hsc73. Strikingly, these domains cover most of 
the hydrophobic regions of GRP78, including the previ- 
ously reported 11-amino-acid domain (A) near the amino 
terminus (Chappell et aL, 1986; Chang et al., mi). The^|| 
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FIG. 3. Determination of the transcriptional start site. 
A. Schematic representation of the probes used for Si 
mapping and primer extension. The asterisks indicate the 
kinase-labeled termini. B. S, mapping of GRP78 tran- 
scripts in HeLa cells. RNA samples were from yeast tRNA 
(lane 1), cells grown continuously in DMEM (lane 2), cells 
grown in glucose-depleted DMEM for 16 hr (lane 3), and 
cells grown in DMEM with 7 pM A23187 for 5 hr (lane 4). 
The lane marked M refers to a size marker from a dideoxy 
sequencing reaction run in parallel on the same gel. 
C. Primer extension using a synthetic primer. The RNA 
samples isolated from HeLa cells grown in the absence 
(lane 1) or presence of A23187 (lane 2) were used. The re- 
sultant DNA fragments were electrophoresed on a 6% 
polyacrylamide sequencing gel with DNA fragment size 
markers. The autoradiograms are shown. The sizes (nu- 
cleotides) of the major S, protected band and the primer 
extension product are indicated. 



other six domains fall into two symmetrical groups of three 
(B,C,D and E,F,G) around the center of the protein. The 
most divergent sequences were found at the highly hydro- 
philic region near the carboxyl third of the protein. Inter- 
estingly, the last four amino acids (Lys-Asp-Glu-Leu) 
which are important for retaining GRP78 in the endoplas- 
mic reticulum (Munro and Pelham, 1987) are conserved 
among the human, rat, and hamster GrP78 and are lost 
from the cytoplasmic HSP70. 



Structure of the processed human GRP78 gene 

In contrast to hu28-l, recombinant phage hu3-2B does 
not contain any introns. The coding region was mapped by 



shown) [ r These analyses, coupled with sequencing of "the 
DNA flanking the coding region, revealed that this recom- 
binant contains features characteristic of a processed gene 
(Fig. 2B). First, the coding region is contiguous through- 
out the gene (Ting, 1988). When compared to the func- 
tional gene, we detected a 1 2-nucleotide deletion close to 
the amino terminus of the mature protein and a 4-nucleo- 
tide insertion within exon VI which results in a shift of the 
reading frame. In addition, point mutations were found 
throughout the coding region. At the 3' end of the gene, 
the consensus sequence for the poly(A) addition site 
AATAAA was present; shortly downstream from that se- 
quence, there was a stretch of adenosine residues. These 
results clearly indicate that this gene was derived from a 
RNA intermediate of the GRP78 gene. 

As in the case of other processed genes (Hollis et aL, 
1982; Wilde et <//., 1982; Karin and Richards, 1982), a 
short direct repeat sequence, which probably represents the 
site of insertion of the processed gene, was located flank- 
ing the inserted sequence. The repeat is an AT-rich 13-mer 
with the sequence GAAAATTAAACAA. The sequence 
immediately surrounding the processed gene is strikingly 
AT rich. The 100-nucleotides upstream from the inserted 
gene are 66% AT and the 200 nucleotides downstream 
from the inserted gene are 15% AT. When a 0.9-kb frag- 
ment containing the 5'-flanking region of this processed 
gene was fused to bacterial CAT gene, we could not detect 
any CAT activity after transient transfection into K12 cells 
(Ting, 1988). 



Expression and regulation of the human 
GRP78 promoter 

In hamster cells, the GRP78 gene is transcriptionally ac- 
tivated by the calcium ionophore A23187 (Resendez et aL> 
1985) and the K12 ts mutation (Lee et a/., 1983). By dele- 
tion and in vivo competition, the important region for 
these responses in the rat GRP78 promoter is mapped 
within 430 nucleotides upstream from TATA box (Lin et 
al % 1986; Chang et ai t 1987). To test if the human GRP78 
gene isolated from the hu28-l clone contains a functional 
promoter inducible by A23187 and the K12 ts mutation, 
human GRP78 promoter/CAT fusion genes were con- 
structed and transfected into K12 cells. As a comparison, 
the CAT fusion gene pi 10 containing the rat GRP78 pro- 
moter (Resendez et al, 1985) was also tested. After 30 hr, 
the cells were treated with 7 fiM A23187 or shifted to the 
nonpermissive temperature 40°C. Cell extracts were as- 
sayed for the CAT activity (Fig. 6A) and the promoter ac- 
tivities of the CRP78/CAT fusion genes under induced or 
non-induced conditions are summarized (Fig. 6B). 

Our results show that the human GRP78 promoter is 
capable of conferring A23187 and K 12 ts mutation induc- 
tion to the CAT gene. Further, no significant changes were 
observed at basal or induced activity when the 5'-flanking 
sequence of the human GRP78 gene was reduced from 
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aGCTCGATTACAGCCTGACCACCACACCTGGCTAATTTCTGTATTTTTAGTAGAGGATGGGCTTTCA 3840 
^AACCK^c™^ 

a 0TC acag T ac™aaa™^ 

1NTRON 7- ->G E R P L T K D N 
AGTGTCACTTTATGCTTCAGACTGAAATG<XKMX!ATCTAGATGTGCTAATGCTTGTCAGTAACAACTA^ *560 

tc 2 tc ^ T aca^g^^ 

O^GA^^ 4800 

ag^agLgI^^ 

aaot^c^c^^ 

aa^aaagaa^aagk™^^^ 
gtggog^agaagaagaa^^^ocgggx^ 

GTnxnXTAOCATAAGTCACACC^AlAMTGTlTGTTArriACACnXiOT 

FIG 4 Primary structure of the human GRP78 gene. The DNA sequence is determined from ^rae h^8-L^e to- 

iSamino-acid leader sequence and produces the amino terminus of the mature protein. 



1650 nucleotides to 368 nucleotides (our unpublished re- 
sults and Fig. 6B). When only 170 nucleotides were in- 
cluded, there was a twofold decline in both basal and in- 
duced activities. However, all three constructs showed 
about the same fold of induction at 40°C and by A23187. 
On the basis of these results, we conclude that the human 
GRP78 promoter contains a stress-inducible region that re- 
sponds to A23187 or K12 ts mutation contained within 170 
nucleotides upstream from transcriptional initiation site 
and a distal region at -368 to -170 which enhances the 
basal level of expression about twofold. 



Promoter sequence comparison between human, 
rat, and chick GRP genes 

Comparison of the rat and human GRP78 promoter se- 
quences revealed a strong homology (78%) within 340 nu- 
cleotides upstream from the RNA initiation site of the hu- 
man GRP78 gene (Fig. 7 A). Several interesting sequences 
found in the rat GRP78 promoter (Lin et al. t 1986) includ- 
ing four CCAAT boxes, and a putative Spl binding site 
are conserved in the human GRP78 promoter. The proxi- 
mal 170 nucleotides found to be important for the indue- 
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tive responses are highly conserved (86%). The distal do- 
main important for high basal level shares about 70% se- 
quence homology. Notably, the rat and human GRP78 
promoter sequences start to diverge outside these critical 
regions. Beyond -340, the sequence homology is less than 

20%. . J . , 

Since the GRP78 and GRP94 genes are activated simul- 
taneously by many stress conditions (Lee, 1987), we 
searched for sequence elements in the GRP78 and GRP94 
promoters for consensus sequences which may account for 
their coordinate regulation. For this purpose, the promoter 
sequence reported for the chick GRP94 protein (Kleinsek 
et al , 1986; Munro and Pelham, 1987) is compared with 
that of the rat and human GRP78 genes. The chick GRP94 
promoter sequence is divergent from the rat and human 
GRP78 promoters except for a region of 22 nucleotides 
upstream from the TATA sequence (Fig. 7B). This highly 
conserved region within the three promoters examined lies 
within the critical region required for A23187 and K12 ts 
mutation and is^ devoid of CCAAT and Spl binding sites. 



DISCUSSION 

We have described the isolation of two genomic clones 
encoding a functional and a processed human GRP78 
gene. The functional gene is separated by intervening se- 
quences into 8 exons while the processed gene is contigu- 
ous. Interestingly, among the six positive recombinant 
phage identified, only two different restriction map pat- 
terns emerged. These results, together with the genomic 
blot analysis which showed only a few bands hybridizing 



shown), suggest that there may only be one functional and 
one full-length processed GRP78 gene in the human 

genome. . 

The functional GRP78 gene, in contrast to intronless 
heat-inducible HSP70 genes isolated so far, contains a 
leader sequence and seven introns. Structurally, this gene 
resembles more the intron-containing, heat shock nomn- 
ducible hsc73 gene reported in the rat (Sorger and Pelham, 
1987). However, GRP78 contains a signal sequence which 
targets the protein into the endoplasmic reticulum and 
extra amino and fewer carboxyl terminal sequences as 
compared with hsc73. Interestingly, sequences homologous 
to part of the 5' untranslated region (UTR) and the extra 
amino-terminal sequences of the rat GPR78 gene are 
found within the first additional intron of the rat hsc73 lo- 
cated in its 5' UTR (S.K. Wooden, R.P. Kapur, and A.S. 
Lee, in prep.). 

From tryptic peptide maps of the hamster and chicken 
GRP78 we predicted previously that it must be highly con- 
served through evolution (Lee et al., 1981). More direct 
evidence came from the comparison of nucleotide se- 
quence between hamster and rat which revealed a 94% ho- 
mology in the coding region (Ting et al., 1987). The 
magnitude of conservation between the primate gene re- 
ported here and the rodent gene reported previously is also 
surprisingly high. Although they diverged from each other 
about 75 million years (my) ago, the percent of homology 
in the coding DNA region is as high as 91% between these 
two species. Based on this conservation, we calculated that 
the rate of silent site mutation within the coding region is 
only 0.1% per my as compared to the 0.5-1% average mu- 
tation rate generally observed (Hayashida and Miyata, 
1983- Yaffe et al, 1985). The higher rate of generating 



.k 




FIG. 6. Promoter activities of human CRP78-CAT fusion gene constructs. A. K12 cells were transfected with 3 ng of 
CAT fusion genes containing various, lengths (368 and 170 nucleotides upstream from the CAP site) of the human 
GRP78 promoter. As a comparison, a CAT construct (pi 10) which contains 1290 nucleotides of the rat GRP78 promoter 
(Resendez et al., | 9 «) >s also included. After 30 hr, the cells were either maintained at 35°C (lane 1), shifted to 40°C 
(lane 2), or treated with 7 nM A231&7 af 35'c for 16 hr (lane 3). Protein extracts from the transfectants were assayed for 
CAT activities. The positions of chloramphenicol (CM) and its acetylated forms (3-Ac and 1-Ac) are indicated. B. 
Summary of the promoter activities promoter features of the GRP78 promoter contained within the CAT fusion 
gene constructs are schematically jwewited in the line diagrams. The symbols marked are (■) TATA, (^ ) CCAAT, 
(t=, ) CCAAT inverted, (A) putativs %r,\ Ending site, and ( r— ) the major RNA initiation site from the GRP78 gene. The 
results represent the average of thrs* ^ rite experiments and are expressed as a percentage of the activity produced by 
the -1650 deletion of the human promoter in the absence of temperature shift or A23187 treatment. 



with the coding region indicate x,x there is se i ec tive 
pressure to conserve the GM1>. > /A , rit ion xhe y 
UTR shares 88% homology b*„t* ; toman * nd rat . The 
conserved region covers the train V 1,7* ^ tn b ond the 
polyadenylation sequence, and tat >^ of divergence 
within the proximalhalf of the V v; V< „ much , ower , han 
in the distal half. Thu observ*^ rt ^ CTceptjon tQ the 
general rule that the rate of suw-.^ ... the dista] por . 
tion is about half the rate in tfe* ^.. xUwd portion of 3. 



UTR (Miyata et al., \W\). test. < f ;* 



samples also re- 



ported strong conservation m tut f; T * reg ion. Genes 
from various mammalian t^eosi ^ for the j3 . actin> 
cardiac muscle actin, the tfycoprotein hor- 

mone, and the tf«coma» m*** ^ ce homol . 
ogy at the 3' UTR (Yaffe et al., Y A < A v« Beveren et al., 
1983; Van Straaten et al., V/<l h m ^ hiologicai sig . 



nificance of conservation in the 3' UTR is not clear, it 
might regulate gene expression at the translational level 
(Liebhaber and Kan, 1982). 

With the identity of GRP78 as the immunoglobulin 
heavy-chain binding protein (Munro and Pelham, 1986), 
GRP78 represents a stress protein with a demonstrable 
binding function to other proteins. It may do so by bind- 
ing to the hydrophobic regions of proteins improperly 
folded in the ER and prevent their aggregation. Since the 
expression of GRP78 is correlated with the inhibition of 
protein glycosylation (Chang et al., 1987), it is possible 
that under stress conditions, the glycosylation process is 
disrupted resulting in the accumulation of underglycosyl- 
ated proteins in the endoplasmic reticulum. Without the 
sugar moiety some proteins may not fold properly and 
some hydrophobic regions normally embedded in the 
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- 1 6 1 AATCGGCAGCGGCCA - GCTTGGT - 1 40 

- 1 2 4 AA TCGGCGGCGGCCA - GCTTGGT - 1 0 3 

- 2 0 2 AATCG ACGCCGGCCACGCTCCGT - 1 8 0 



FIG 7 Sequence comparison between the human and the rat GRP78 promoter. A. The promoter sequence of rat 
(Chang et al 1987) and human GRP78 gene reported here are aligned to obtain maximal homology. The sequence is 
numbered starting from the major RNA cap site (r-) of the human GRP78 gene as +1. Identical nucleotides are indi- 
cated by vertical dots. The TATA and CCAAT sequences are boxed. The putative Spl binding sites are underlined. The 
deletion end points of the two GRP78-CAT fusion genes tested (-368 and - 170) are indicated (Y). B The consensus 
sequence found in the promoter regions of the human, rat GRP78 gene and the chicken GRP94 gene (Kleinsek et at., 
1986). The sequences are numbered starting from their RNA initiation site. The location of this sequence in the human 
and rat GRP78 promoter is marked by a heavy line in A. 
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hydrophilic environment or shielded by the sugar residues 
may now be exposed. These hydrophobic domains may be 
the binding sites for GRP78. In support of this hypothesis, 
the Ig heavy-chain constant domain C H 1, which binds to 
GRP78, is hydrophobic and normally covered by the light 
chain (Hendershot et al. 9 1987). It is postulated that bind 



tion and the premature secretion of these proteins (Bole et 
al. t 1986). To carry out this proposed function, it is desir- 
able that GRP78 be a soluble protein. It also requires that 
GRP78 contains some hydrophobic domains to interact 
with the exposed hydrophobic regions of the improperly 
folded proteins. From the hydropathicity profile of 
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HSP70 family. Since HSP70 is also postulated to have pro 
tein binding ability (Pelham; 1986; Pinhasi-Kimhi et ah, 
1986) it is possible that these conserved regions are the 
critical protein binding domains within the stress gene pro- 
tein family. In vitro mutation of the conserved domains 
may yield important information about their functional 
significance. 

From the analysis of the rat GRP78 gene promoter, the 
regulatory elements for K12 ts mutation and calcium iono- 
phore A23187 induction are localized within 430 nucleo- 
tides upstream of the TATA sequence (Chang et al. , 1987). 
A 291-nucleotide subfragment from the rat GRP78 pro- 
moter fused to heterologous promoters is capable of re- 
sponding to various stimuli, as well as increasing substan- 
tially the basal level activity of the promoters (Lin et al., 
1986; Y.K. Kim and A.S. Lee, in prep.). This suggests that 
there are at least two distinct regulatory elements in the 
GRP78 promoter. Our deletion studies of the human 
GRP78 promoter further revealed that a 170-nucleotide 
fragment immediately upstream from the RNA initiation 
site contains the DNA sequence requirement for the K12 ts 
mutation and calcium ionophore induction. Further up- 
stream sequences enhance the basal level expression. The 
discovery of the consensus sequence among the otherwise 
divergent GRP78 and GRP94 promoters would allow us to 
test whether this short conserved sequence is functionally 
significant in the in vivo regulation of the GRP genes by 
stress. Detailed analysis of the cw-acting DNA regulatory 
elements in the GRP78 promoter, coupled with the isola- 
tion of the transcriptional regulatory factors, will provide 
the critical information on its regulatory mechanism. 
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The cene encoding GRP78 has been shown to be constitutively expressed in many cell types and is inducible 
bv the calcium ionophore A23187. To understand the regulation of GRP78 transcription, we analyzed the 
components that control its basal-level expression. By transfecting deletions into cells, we have identified a 
54-nucleotide m-acting regulatory element important for high basal-level expression and a contiguous 
50-nucleotide element for both basal-level expression and A23187 induction. Using DNase footprinting assays 
with both rat and human GRP78 promoters, we demonstrated that the protein factors present in the HeLa cell 
nuclear extracts bind to the regulatory regions identified by the deletion studies. This domain contains a 
palindromic sequence and is highly conserved among GRP genes in Caenorhabditis elegans, chicks, rats, and 
humans. 



^ A set of stress-inducible proteins known as the glucose- 
regulated proteins (GRPs) are constitutively expressed in 
mammalian cells. When the cells are deprived of glucose or 
treated with reagents that inhibit protein glycosylate, 
perturb intracellular calcium stores, or denature proteins, 
the synthesis of the GRPs is rapidly increased (11). The 
major GRP in mammalian cells has a molecular mass of 
about 78,000 daltons and is localized within the endoplasmic 
reticulum (ER) (18, 25a). This protein, generally referred to 
as GRP78, shares about 60% sequence homology with the 
70,000-dalton heat-shock protein (HSP70) and is thought to 
have a function analogous to that of HSP70 but within the 
ER (18). By immunoprecipitation and comparisons of pep- 
tide maps and amino-terminal sequences, GRP78 was found 
to be identical to the immunoglobulin-binding protein re- 
ferred to as BiP (1, 4, 15). In lymphoid and fibroblast cells, 
GRP78-BiP has been shown to bind to immature immuno- 
globulin and aberrant proteins (1, 18). Recently, the se- 
quence of the 3,215-dalton steroidogenesis activator poly- 
peptide isolated from a rat cell tumor has been reported (17). 
Strikingly, this 30-residue peptide, which has the ability to 
facilitate cholesterol side-chain cleavage to pregenolone, is 
identical, except for two residues, with the last 30 amino 
acids of the GRP78 carboxyl terminus as predicted from the 
DNA sequence of the rat GRP78 gene (15, 23). 

The gene encoding GRP78 has recently been isolated from 
a human genomic library (22a). Both the rat and the human 
GRP78 genes are split into eight exons and contain a signal 
sequence which targets GRP78 into the ER (22a, 25a). The 
expression of the GRP genes is regulated at the transcrip- 
tional level (9, 10, 13, 19). Thus, when the cells are treated/ 
with potent inducers such as the calcium ionophore A23187, 
P-mercaptoethanol, or tunicamycin, the transcription of the 
GRP78 gene is increased 10- to 25-fold within 5 h (3, 9, 19). 
The GRP78 promoter is highly active, and a 291-nucleotide 
(nt) fragment from the promoter region functions as an 
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enhancer when fused to other cellular promoters (12). This 
region of the GRP78 promoter is G+C rich and contains 
several CCAAT sequences and extensive arrays of tandem 
and inverted repeat motifs characteristic of other cellular 
and viral enhancers (6). In addition, this 291-nt fragment, 
when placed upstream of a heterologous gene, can confer 
inducibility by calcium ionophore (12). 

GRP78 transcript levels in HeLa cells. To facilitate the 
search for cellular factors which interact with the DNA 
regulatory domain, we utilized the HeLa cell system, which 
can be grown in suspension to provide high yields of cellular 
extracts. We have shown previously that GRP78 transcripts 
can be detected in a variety of mammalian cell lines (20). 
Since the focus of this study was on the utilization of the 
human HeLa S3 cell line as a source of functional protein 
extracts, we first investigated the expression of the endoge- 
nous GRP78 gene in these cells. GRP78 mRNA was detect- 
able in noninduced HeLa cells grown in suspension cultures 
(Fig. 1). When the cells were further treated with 0.5 or 2 jxM 
of the calcium ionophore A23187 or were grown continu- 
ously in the culture medium without a medium change, the 
endogenous GRP78 transcript levels were further increased 
(Fig. 1). These results demonstrate that the GRP78 gene is 
constitutively expressed in HeLa cells to provide a basal 
level of GRP78 transcripts. Furthermore, A23187 is capable 
of eliciting a strong stress response in the HeLa cells. 
However, A23187 is relatively toxic to HeLa S3 cells com- 
pared with fibroblast cell lines from hamster, rat and mouse. 
The optimum conditions for the induction of the GRP78 gene 
in HeLa S3 cells are 2 uJVI A23187 and an incubation period 
not longer than 6 h. Higher concentrations of A23187 or 
longer incubation periods or both result in substantial cell 
death (unpublished results). 

Transcriptional activities of rat GRP78 promoter-deletion 
mutants in human cells. It has been shown previously that a 
291-nt Smal-StuI fragment upstream of the TATA element of 
the rat GRP78 promoter is important for both high basal level 
and A23187 induction of the reporter gene after transfection 
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TABLE 1. Relative CAT activities" 



1 



2S 3 



FIG. 1. Enhanced expression of GRP78 mRNA levels in HeLa 
cells. Cytoplasmic RNA was extracted from HeLa cells after 
incubation under the following conditions: control (lane 1), 0.5 \lM 
A12387 (lane 2), 2 jj,M A23187 (lane 3), and medium starvation (lane 
4). For control and calcium ionophore (A23187) treatment, cells 
were placed in fresh medium 8 h before cytoplasmic RNA extrac- 
tion; for calcium ionophore treatment, cells were incubated for 5 h 
with the ionophore before RNA extraction; and for the medium 
starvation condition, the cells were kept in the original medium and 
incubated for an additional 24 h before RNA extraction. The 
cytoplasmic RNA samples (5 p,g) were electrophoresed on a form- 
aldehyde denaturing gel, trans-blotted onto a nitrocellulose filter, 
and hybridized with radiolabeled hamster GRP78 cDNA plasmid 
p3C5 (23a) as described previously (10, 20). The arrow indicates the 
position of the 2.7-kilobase GRP78 transcript. 



into hamster fibroblast K12 cells (2, 12). The sequence of this 
region and the proximal promoter sequences are shown in 
Fig, 2A. To delineate further the functional domains within 
this region, a series of 5'-deletion mutants were constructed 
and their deletion endpoints were determined by DNA 
sequencing. To test the functionality of these deleted 5'- 
fianking sequences, they were fused upstream to the bacte- 
rial chloramphenicol acetyltransferase (CAT) transcriptional 
unit (Fig. 2B). These 5'-deletion plasmids were transfected 
into HeLa, HepG2, and K12 cells. The cells were transfect- 
ed with 3 or 5 |xg of cesium chloride gradient-purified 
plasmid DNA with 5 or 7 u,g of high-molecular- weight carrier 
HeLa DNA. The transfection conditions and the assay of 
CAT activity were performed as described previously (16, 



HepG2 



K12 



Deletion 
endpoint 



Basal 
level 



Level with 
A23187 (increase 
[fold]) 



Basal 
level 



Level with 
A12387 (increase 
[fold]) 
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100 


150(1.5) 


100 


500 (5.0) 


-154 


20 


50 (2.5) 


20 


160 (8.0) 


-130 


3 


4 (1.3) 
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20 (3.5) 


-104 


2 


2 (1.0) 
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3 (1.5) 


-85 


1.5 


1.5 (1.0) 


2 


3 (1.5) 


pSVOCAT 


0.5 


0.5 (1.0) 







a Level of CAT activity of pGRP78(-208)CAT under noninduced condi- 
tions was set as 100. Results represent averages of three sets of transfections 
for each cell line. 

19, 24). The promoter strengths of the deletion mutants were 
determined by the CAT enzymatic activities under nonin- 
duced and A23187-induced conditions (Table 1). In these 
transient transfection experiments, we failed to detect CAT 
activities in the HeLa cells transfected with the CAT plas- 
mids, probably due to the low transfection efficiency of the 
HeLa cells used. However, our results with the human 
HepG2 and hamster K12 cells consistently showed that 
between deletions -208 and -154, a fivefold reduction in the 
basal level of expression was observed, suggesting that this 
54-nt domain contains important c/s-acting regulatory ele- 
ments for high basal-level expression. When the promoter is 
deleted to -130, the basal level was further reduced and the 
2.5- to 8-fold induction by A23187 observed in pGRP78 
(-154)CAT was reduced by one-half. In the deletion mu- 
tants pGRP78(-104)CAT and pGRP78(-85)CAT, only min- 
imal activities were detected. While these activities were 
higher than the promoterless plasmid pSVOCAT, the basal 
level was further reduced and A23187-induced expression 
was eliminated. This implies that the 50-nt domain between 
deletions -154 and -104 contains important DNA elements 
for both basal level and A23187 induction. 

The HeLa extract contains protein factors that bind to the 
rat GRP78 regulatory domain. Using an in vitro transcription 
assay, we observed that the nuclear extracts from HeLa 
cells were able to transcribe the rat GRP78 promoter in a 
template- and extract-dependent manner. In addition, by gel 
mobility shift experiments we detected multiple complexes 
formed between HeLa nuclear extract and the regulatory 
region of the GRP78 promoter defined above (unpublished 
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FIG. 2. Promoter deletions and activities of CAT constructs. (A) The sequence of the rat GRP78 promoter. Bases are numbered in 
reference to the major mRNA cap site of GRP78 (12). Several relevant restriction sites are noted. The deletion mutants were generated by "&£j^ 
BAL 31 digestion of pHO (19). The deletion endpoints for each of the deletion plasmids used for the transfection are indicated. The CCAAT 
and TATA regions are boxed. The region protected from DNase I digestion described below is bracketed. (B) The organization of the deletion j 
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FIG. 3. Footprint analysis of the rat GRP78 promoter. A 159-nt 
EcoRI-Hindlll fragment (spanning -195 to -88) was end labeled, 
cut with Aspll% to obtain the labeled coding strand or with Sail to 
obtain the labeled noncoding strand, and gel purified. The HeLa 
nuclear protein extract was prepared as described previously (21). 
(A) Coding strand footprint. Lanes: 2 and 4, DNA probe incubated 
without protein extract; 3, DNase-treated probe with 44 fxg of HeLa 
nuclear protein extract; 1 and 5, Maxam and Gilbert G sequencing 
reaction (14) of the labeled strand. (B) Noncoding strand footprint. 
Lanes: 2 and 5, DNase-treated probe without protein extract; 3 and 
4, 44 and 66 u.g, respectively, of HeLa extracts; 1 and 6, Maxam and 
Gilbert G sequencing reactions. Diagrams of the coding and non- 
coding regions are shown next to the gels. The footprint regions A 
and B are boxed. The numbers denote nucleotides correspond to the 
locations of the footprints relative to the transcription start site ( + 1) 
of the rat GRP78 gene. 

results). To define the binding sites of these factors within 
the rat GRP78 promoter, the subfragment spanning —195 to 
-88 was end labeled, mixed with HeLa nuclear extracts, and 
subjected to DNase I digestion. Binding reaction mixtures 
(20 jjuI) contained 10 mM Tris-hydrochloride (pH 7.5), 50 mM 
NaCl, 10 mM (3-mercaptoethanol, 1 mM EDTA, 4% glyc- 
erol, 2 mM MgCl 2 , 1 u.g poly(dl-dC), and 0.4 to 1 ng of 
end-labeled DNA. After 20 min of incubation at room 
temperature, 20 uJ of 5 mM CaCl 2 -l mM EDTA was added; 
freshly prepared DNase I was added, and the mixture was 
incubated for 60 s at room temperature. To terminate the 
nuclease reaction, 50 u.1 of buffer (200 mM NaCl, 20 mM 
EDTA, 1% sodium dodecyl sulfate, 250 |xg of tRNA per ml) 
was added. Samples were deproteinized by phenol-chloro- 
form extraction, and the DNA was ethanol precipitated, 
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FIG. 4. Footprint analysis of the human GRP78 promoter. 
pUCH223 (-170, +53), which contains the 223-nt PvuII-NruI 
fragment from the human GRP78 promoter (22a) subcloned into the 
Smal site of pUC8 (with the PvuW site proximal to the EcoK\ site of 
the pUC8 polylinker sequence), was linearized with Hindlll, end 
labeled with T4 DNA polymerase, and cut with EcoRl to obtain the 
labeled coding strand. The labeled noncoding strand was obtained 
by labeling the EcoRI-Hindlll insert of pUCH223 and recutting with 
Sail. (A) Coding strand footprint. Lanes: 1 and 4, DNase-treated 
probe without nuclear extract; 2 and 3, DNase-treated probe with 44 
and 66 u.g, respectively, of HeLa nuclear protein. (B) Noncoding 
strand footprint. Lanes: 2 and 6, DNase-treated probe without 
nuclear extract; 3, 4, and 5, DNase-treated probe with 22, 44, and 66 
M-g, respectively, of HeLa nuclear protein; 1 and 7, Maxam and 
Gilbert G sequencing reaction of the labeled strand. Diagrams of the 
coding and noncoding regions are shown next to the gels. The boxes 
represent the footprint regions (A' and B'). The numbers denote 
nucleotides and correspond to the locations of the footprints relative 
to the transcription start site (+1) of the rat GRP78 gene. 
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protected from DNase I digestion. These domains reside 
within the functional domains (-104 to -208) defined by in 
vivo transfection experiments described above. 

Binding of protein factors to the human GRP78 promoter. 
We have previously shown that the rat and the human 
GRP78 promoter sequences are highly conserved (22a). The 
170 nt proximal to the TATA sequence in the human GRP78 
promoter have been identified as important for basal-level 
expression and inducibility by A23187 (22a). Sequences 
further upstream enhance the basal-level expression by 
twofold. Therefore, it is of interest to determine whether the 
human GRP78 promoter interacts with protein factors from 
HeLa cells in a similar pattern as the rat promoter. For this 
purpose, a human GRP78 promoter subfragment spanning 
from -170 to +53 of the promoter was end labeled at either 
its coding or noncoding strand, mixed with the HeLa ex- 
tracts, and subjected to DNase I digestion as described 
above. The results indicate that regions spanning residues 
-101 to -160 are protected in the coding strand (Fig. 4, A' 
box) and residues -103 to -155 are protected in the non- 
coding strand (Fig. 4, B' box). The A' and B' sequences from 
the human GRP78 promoter are similar to the A and B 
sequences of the rat GRP78 promoter. Another region more 
proximal to the TATA element also exhibited some protec- 
tion in both strands at high DNase I concentration. This 
region was not detected in the rat GRP 78 promoter shown in 
Fig. 3, since the rat DNA fragment used for the footprint 
analysis was shorter than the human fragment. In other 
experiments in which a longer rat promoter fragment was 
used, the same region was protected (T. Nakaki and A. S. 
Lee, unpublished results). 

Comparison of rat and human GRP78 promoter regions 
that bind to cellular factors. The sequences of the rat and 



human GRP78 promoters binding to proteins are summa- 
rized in Fig. 5. Generally, the same region protected by 
DNase I digestion in the rat promoter is also protected in the 
human promoter. This is particularly evident in the case of 
the noncoding strand, where one end point of protection is 
identical and the other end is similar. The human promoter 
exhibits a footprint on its coding strand which directly 
overlaps that of its noncoding strand. However, the footprint 
(Fig. 4, A box) of the coding strand of the rat promoter is 
longer than the noncoding footprint (Fig. 4, B Box) by about 
10 to 20 nt. These subtle differences may be related to the 
DNA sequence divergence of the two promoters. However, 
the sequences at the protection boundaries are identical for 
the coding strands of the two promoters. Most importantly, 
the protected domains coincide with the region shown to be 
critical for in vivo expression of the GRP78 gene. Within this 
region, a palindromic sequence and three pairs of short 
direct repeats are present. 

The GRP78 regulatory domain is highly conserved. The 
above in vivo deletion and the footprint protection results 
both point to a small domain within the rat GRP78 promoter 
that interacts with proteins from HeLa nuclear extracts. The 
availability of sequence data from several GRP genes al- 
lowed us to compare their sequences for possible common 
consensus (Fig. 5). While the rat and human GRP78 genes 
are highly conserved over a 200-nt region of the two promot- 
ers (22a), the GRP78 promoter sequence from Caenorhab- 
ditis elegans is very divergent from the mammalian sequence 
(M. Heschl and D. Baillie, submitted for publication). How- 
ever, there is one region within the C. elegans promoter 
which has 19 out of 23 nt matched with the rat sequence. As 
shown in Fig. 5, this region resides within the critical domain 
for high basal-level expression defined by our analysis. On 
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FIG. 5. Footprint sequence of the rat and human DNA and its conservation. The sequences that footprint on the coding strand of rat (Af; 
and human (A') GRP78 gene and on the noncoding strand of rat (B) and human (B') GRP78 gene are bracketed. The palindromic sequence^ 
is indicated by a pair of hatched arrows, with the center of symmetry highlighted by a black dot. Other pairs of direct repeats are indicated 
by the other types of arrows. The homologous sequences between the rat and human promoter at the coding strand footprint boundaries are V 
boxed. The highly conserved domain of the rat GRP78 promoter with the 5'-flanking regions of GRP genes of the other species is also shown!*' 
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by ourfootprint analysis: The strict conservation of these 
short domains within such phylogenetically diverse species 
as worms, chicks, rats, and humans strongly implies that 
they have a functional significance in the regulation of GRP 
gene expression. 

The GRP78 gene system represents a unique model for the 
study of the regulation of genes encoding ER-localized 
proteins. Most interestingly, GRP78 transcriptional activity 
appears to correlate directly with the amount of ER activity 
or ER damage in a variety of cell types, bringing up the 
question of how the state of another organelle, such as the 
ER, can regulate a gene in the nucleus. Clearly, intermediate 
molecules must exist and traverse the membranes to com- 
municate the signal to the GRP gene system. Since the ER 
membranes are associated with the perinuclear membranes, 
traffic between the two organelles can be envisioned. 

By 5 '-deletion analysis, we show here that a 100-nt se- 
quence within this region is critical for both high basal and 
A23187-induced expression. This same DNA segment is 
highly conserved among the GRP genes and is protected by 
cellular factors in DNase protection assays. If the footprint- 
ed domain represents the core for the binding activities of 
cellular factors that regulate GRP78 gene expression, this 
region may represent binding sites for a novel class of 
transcriptional factors that have not yet been described. By 
sequence analysis, it is devoid of recognition sites for 
characterized transcriptional factors such as Spl, CTF, API, 
AP2, and TFIID (5). Since this region contains a pair of 
inverted repeats which basically results in the same se- 
quence independent of orientation, it may explain the orien- 
tation-independent characteristics of the DNA enhancer 
element contained within the GRP78 promoter. 

Although GRP78 shares partial sequence homology with 
members of the 70,000-dalton heat shock protein family and 
may share the properties of binding to abnormal proteins 
under stressful conditions, the regulation of GRP78 expres- 
sion is distinct from that of HSP70 in many ways. First, the 
transcriptional activation of the GRP78 gene is sensitive to 
the protein synthesis inhibitor cycloheximide (8, 20), 
whereas the induction of the heat-inducible HSP70 gene 
does not require de novo protein synthesis. In fact, it has 
been demonstrated that the heat shock regulatory factor can 
bind to the heat shock element in the presence of cyclohex- 
imide (26). In addition, the most potent inducers of GRP78 
expression, such as calcium ionophores and p-mercapto- 
ethanol, do not affect HSP70 expression (7, 9, 25). There- 
fore, whatever is the mechanism of induction of GRP78, it is 
likely to be different from that of the HSP70 gene and 
probably involves multiple steps (7, 11, 25). The identifica- 
tion of the rra/u-acting factors which regulate the expression 
of the GRP78 gene holds the key to uncovering the interme- 
diate steps whereby a signal from the ER can be transmitted 
to the nucleus to enhance GRP78 transcription. 
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GRP78, also known as BiP, is one of the better-characterized molecular chaperones. It has been implicated 
in protein folding and also calcium sequestration in the endoplasmic reticulum. When the cells are subjected 
to endoplasmic reticulum stress, in particular the depletion of stored calcium and/or the accumulation of 
abnormal proteins, the rate of transcription of grp78 is enhanced. Previous studies have shown that the core 
region of the rat grp78 promoter (-170 to -135), which is 95% conserved with the human grp78 core (-133 
to -98), is one of the key regulatory elements. Using ligation-mediated PCR, we have found that there are 
specific changes in factor occupancy after stress induction and the major changes occur within a cluster of 
bases located in the 3' half of the grp core, whereas other regulatory elements are constitutively occupied. This 
inducible binding to the 3' half of the human grp78 core region is observed under diverse stress signals, 
suggesting a common mechanism for the grp stress response. Nonetheless, the lack of constitutive in vivo 
protection at this region is not due to the absence of a binding factor in nuclear extracts. Using in vitro gel 
mobility shift assays, we detected a constitutive binding activity which exhibits specificity and affinity to the 
stress-inducible region. Through sodium dodecyl sulfate-polyacrylamide gel electrophoresis size fractionation 
and renaturation analysis, the activity is found in polypeptides with molecular sizes of 65 to 75 kDa. After a 
three-step purification scheme including core affinity column chromatography, we purified p70CORE, which is 
about 70 kDa in its monomeric form. The purified p70CORE is sufficient to form a complex specific to the 
stress-inducible region. 



ji The 78-kDa glucose-regulated protein (GRP78), also known 
Is the immunoglobulin heavy-chain-binding protein (BiP), is a 
Jpajor cellular protein localized in the lumen of the endoplas- 
mic reticulum (ER). It is a member of the 70-kDa heat shock 
protein (HSP70) protein family (46) and, in an ATP-depen- 
gent manner, assists in the translocation, folding, and assembly 
i>f oligomeric proteins (10, 17). In the capacity of a molecular 
Ifiaperone protein, GRP78 binds to a wide repertoire of 
jiroteins traversing through the ER. Another function of 
|GRP78 may involve its calcium-binding capabilities. It is 
|fostulated that GRP78, as well as other ER-localized GRPs, 
|an bind and regulate the mobilization of sequestered calcium 
the ER compartment, thus playing a key role in the 
ISlcium-dependent protein folding and transport processes 
136, 41). 

pThe grp78 gene system provides an interesting model for 
studying the transcriptional activation of ER protein genes (16, 
§7). As a gene encoding a ubiquitous ER protein performing 
prmal physiological functions, grp78 is transcribed at a basal 
Vel constitutively in many different tissues and cell types (32). 
|ts rate of transcription, however, is induced when the ER 
||rnpartrnent is subjected to stress. Inducers for grp78 include 
Jnicamycin, which blocks glycosylation, denaturing reagents 
t&jiich disrupt protein disulfide bond formation, oxidative 
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stress, the accumulation of malfolded or abnormal proteins in 
the ER (11, 15-17, 47), and continuous heat shock (43). 
Brefeldin A, which inhibits protein transport from the ER to 
the Golgi apparatus and causes Golgi resorption, activates 
grp78 transcription (23). In mammalian cells, one of the most 
potent inducers for grp78 transcription is the calcium iono- 
phore A23187 (21, 32) and thapsigargin (19, 30, 44), a non- 
phorbol ester tumor promoter that inhibits specifically the 
endoplasmic reticulum Ca 2+ -ATPase; both reagents enhance 
the transcription of grp78 by about 20-fold through the deple- 
tion of Ca 2+ stores from the ER (8). The efflux of Ca 2+ from 
the ER store has also been known to interfere with protein 
folding (24). 

The mammalian grp78 promoter contains multiple elements 
which are functionally redundant (45). While the distal ele- 
ments contribute mainly to the basal-level expression of the 
promoter, the proximal region contains the regulatory ele- 
ments for stress inducibility (5). Using an extensive series of 5' 
deletion, linker-scanning, and internal deletion mutants of the 
rat grp78 promoter, we established that a region spanning 
-170 and -134 is important for basal-level expression and 
induction by A23187, thapsigargin, malfolded protein, and 
brefeldin (19, 23, 47). This region has been referred to as the 
grp core element because its sequence is conserved among grp 
promoters from many species, including Saccharomyces cerevi- 
siae, Caenorhabditis elegans, chickens, rats, and humans (22, 28, 
34, 39). Especially between human and rat grp78 core ele- 
ments, the nucleotide sequence is 95% conserved. By using 
nuclear extracts from HeLa cells, this region is footprinted 
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(34), and synthetic oligomers of the core elements can compete 
for nuclear factors binding to grp promoters in vitro and in vivo 
(5, 22). Stable integration and amplification of the core 
element in CHO cells coordinately downregulate the endoge- 
nous grp gene expression^ (20). Further, we demonstrated that 
synthetic oligomers of the grp core element are capable of 
conferring partial stress inducibility to a heterologous pro- 
moter (19). 

In this study, we analyzed in vivo footprinting of the human 
grp78 promoter and determined whether in vivo factor binding 
correlates with transcriptional induction of grp78 by various 
forms of stress. We observed that there are specific changes in 
factor occupancy after stress induction and that the major 
changes occur within a cluster of bases located in the 3' half of 
the grp core. Nonetheless, by gel mobility shift assays, factors 
capable of binding to the regulatory elements are present in 
cells prior to stress induction, irrespective of in vivo protection. 

With the importance of the grp core established by genomic 
footprinting and transactivation of heterologous promoter in 
vivo, we seek to purify and characterize the nuclear proteins 
which interact with this site. Here we describe that in HeLa 
nuclear extracts, multiple factors bind to the core element. 
However, among the complexes formed with the grp core, a 
minor complex, termed II', exhibits specific binding to the 
3'-half region, which is required for stress induction of grp78 
(22) and is the site where factor occupancy changes are 
observed after stress induction in vivo. Complex II' binds core 
DNA with high affinity and exhibits rapid association and 
dissociation kinetics. Previously, we identified several proteins 
from HeLa nuclear extracts with molecular sizes of 220, 110, 
90/92, 70, and 55 kDa which were specifically cross-linked by 
UV light treatment to the grp core element (22). Here, by size 
fractionating the HeLa nuclear extract on sodium dodecyl 
sulfate (SDS)-polyacrylamide gels followed by renaturation, 
we observed that complex II' is able to be reconstituted from 
the renatured proteins in the molecular size range of 65 to 75 
kDa. Using a combination of chromatography steps, we puri- 
fied a binding factor (p70CORE) from other binding factors, 
including the abundant Ku autoantigen, which exhibits high 
affinity for the core element. p70CORE, the first human 
nuclear factor identified to interact with the grp78 core ele- 
ment, consists of protein species of 70 kDa in its monomeric 
form and binds specifically to the region required for A23187 
inducibility and other kinds of stress induction of grp78. 

MATERIALS AND METHODS 

Cell lines and culture conditions. HeLa cells were grown in 
Dulbecco's modified Eagle's medium with 5 to 10% bovine calf 
serum. Cells were heat shocked at 42°C or treated with the 
calcium ionophore A23187 at a final concentration of 7 u,M or 
with tunicamycin (0.5 u,g/ml) for indicated periods of time. The 
hamster K12 cell line has been previously described (19, 33). 

Transcriptional run-on analysis. Run-on transcription reac- 
tions were performed with isolated nuclei (approximately 7 X 
10 6 nuclei per reaction) in the presence of 100 \iCi of 
[a- 32 P]UTP (3,000 Ci/mmol; Amersham) as previously de- 
scribed (3). Radioactive RNA was hybridized to nitrocellulose 
filters on which the following plasmids had been immobilized: 
pGEM2 (vector; Promega), pH 2.3 (human hsp70 [48]), pU- 
CHS801 (human hsp90/89a [12]), pHG23.1.2 (human grp78 
[42]), pHA7.6 (human hsc70), and pGAPDH (rat glyceralde- 
hyde-3-phosphate dehydrogenase [9]). The hybridization and 
washing conditions were as described previously (38). r 



min with 0.2% dimethyl sulfate (DMS) at room temperature. 
Genomic DNA was isolated, digested with EcoRI, and cleaved 
with piperidine (4). The methylation pattern of naked DNA 
was obtained by using deproteinized DNA which was treated : 
with DMS in vitro. Genomic footprinting analysis was per- 
formed by using a ligation-mediated PCR method (29). For the 
ligation-mediated PCR of the coding strand, the first genomic 
primer was a 19-mer, 5' TGTCTGTGCTGTCTTGGCC 3', 
which was complementary to the +12 to +30 region of the 
coding strand. The second genomic primer was a 25-mer, 5' 
TTATATACCCTCCCCCAGCCCCGTC 3', complementary 
to the -19 to —43 region of the coding strand. The third 
genomic primer used for end labeling was a 29-mer which 
overlapped and extended 3' of the second primer and had the 
sequence 5' TTATATACCCTCCCCCAGCCCCGTCGTGG 
3'. The first genomic primer used for footprinting of the 
noncoding strand was a 19-mer, 5' GGTCAGAAGTCGCAG 
GAGA 3', complementary to the -283 to -265 region of the 
noncoding strand. The second genomic primer was a 25-mer, 
5' CTGAACCAATGGGACCAGCGGATGG 3', comple- 
mentary to the noncoding strand at nucleotides —252 to -228. 
The primer used for end labeling was a 28-mer which was 
complementary to the noncoding strand at nucleotides -252 
to -225 and thus had the same sequence as the second primer 
with three additional bases, GGC, at the 3' end. The sequence 
of the common linker was as described previously (29). 

Preparation of nuclear extracts. The HeLa cells were cul- 
tured to a density of about 5 X 10 5 cells per ml. Exponentially 
growing cells were harvested to avoid growth density-related 
stress. For the A23187-induced nuclear extract, the HeLa cells 
were treated with 2 |jlM A23187 for 5 h prior to harvest. 
Nuclear extracts were prepared in parallel for the treated and 
nontreated cells as described previously (37), with the follow- 
ing modifications. A protease inhibitor cocktail was added to 
all buffers at the following final concentrations: phenymethyl- 
sulfonyl fluoride, 1 mM; benzamidine, 0.5 mM; and pepstatin, 
leupeptin, and aprotinin, 1 u,g/ml. During cell lysis, the resus- 
pended cell pellet was Dounce homogenized 8 to 20 times with 
tight pestle A (Wheaton Dounce tissue grinders), and the 
degree of lysis was monitored with a phase-contrast micros 4 
scope. The nuclei were obtained by centrifugation with a ;> 
Sorvall SS-35 rotor at 13,200 rpm for 30 s. Following ammo-^ . 
nium sulfate precipitation, the nuclear extract was subjected to 
ultracentrifugation with a VTi70.1 rotor at 30,000 rpm for 20 ^ 
min. After the final dialysis step, the extracts were quick-frozen tJ;. 
in liquid nitrogen and stored in aliquots at -70°C. "j tiT- 

Gel shift assays. The synthetic oligonucleotides were puri- 
fled on 12% denaturing polyacryl amide gels, and equal^Jil 
amounts of the complementary strands were reannealed and Jf|£f. 
labeled with Klenow enzyme in the presence of [a- 32 P]dCTr\||pg 
(35). The probe was purified from free nucleotides by Microj j|f§p 
con 10 columns (Amicon, Beverly, Mass.). Each gel shift j|p| 
reaction was carried out in 20-^1 volume in the presence of lXj^fpgl 
binding buffer {10 mM Tris (pH 7.5), 2 mM MgCl 2 , 1 mM^g| 
EGTA [ethylene glycol-bis ((3-aminoethyl ether)-NJ*&' Jf[*^** 
tetraacetic acid] , 80 mM NaCl, 0.25 g of bovine serum aJbunun jg§ 
(BSA) per ml}, with the few exceptions indicated in the figure||||g 
legends. In regular binding studies, 0.1 u,g of either control or|p 
induced HeLa nuclear extract, 1 ng of probe (specific activity, 
10 7 to 10 8 cpm/|xg), and 10 ng of sonicated salmon sperm DNAj 
or 20 ng of sonicated poly(dl-dC) (average size, 200 bp) as^j 
nonspecific competitor were added. However, no nonspecific 
DNA was used when DNA affinity-purified column fractions^ 
were , assayed. In competition studies, nuclear ^extract ^aj 





fbefore the addition of probe. The complete reaction mixtures 
twere incubated on ice for 20 to 30 min except where indicated. 
^Electrophoresis was carried out in a Protein II xi apparatus 
|(Bio-Rad, Richmond, CaJif.)^eitherl50V^for 4.5 h or 300 V 
for 2.5 h. During electrophoresis, the temperature" was main-' 
Stained at 4 to 16°C through a recirculating ice water bath. The 
fgel was dried, and the autoradiogram was exposed for 10 to 18 
' h at -70°C except where indicated. 

" Renaturation of proteins recovered from SDS-polyacryl- 
amide gels. The procedure used was as described previously (2, 
26), with the following modifications. Samples containing 200 
%g of nuclear extracts were heated at 65°C for 3 min in SDS 
Sample buffer (10 mM Tris [pH 6.8], 2% SDS, 5% glycerol, 2% 
p-mercaptoethanol, 0.1% bromophenol blue) and size frac- 
tionated by SDS-polyacrylamide gel electrophoresis (PAGE). 
Electrophoresis was carried out until the bromophenol blue 
dye ran out, in Protein II xi (16- by 20-cm plates) cooled to 4°C 
^as described above. High-molecular-weight size markers 
?(GIBCO-BRL) were run along with the samples for the 
^calibration of the protein sizes in the sample lane, which was 
'cut into 16 1- by 1-cm slices, crushed in Eppendorf tubes with 
\ micro-tissue grinder, and soaked and rocked for 18 to 24 h in 
500 \l\ of elution buffer (50 mM Tris [pH 7.9], 1 mM 
dithiothreitol, 0.2 mM EDTA, 1 mM phenylmethylsulfonyl 
fluoride, 0.1 mg of BSA per ml, 2.5% glycerol, 0.1% SDS) (26) 
at 4°C. The eluate was collected through centrifugation 
through 0.45-^m Z-spin columns (Gelman Sciences) at 14,000 
g for 5 min. The filtrate (250 \x\) was acetone precipitated, 
washed with methanol, and air dried. Renaturation was carried 
out in renaturing buffer (20 mM Tris [pH 7.6], 10 mM KC1, 2 
mM dithiothreitol, 0.1 mM phenylmethylsulfonyl fluoride) at 
4°C with rocking for 18 to 24 h. To store the renatured 
fractions, 50% glycerol in renaturing buffer was added to each 
( tube to a final glycerol concentration of 25%. The samples 
'■were then quick-frozen in liquid nitrogen and stored at -70°C 
^To renature the affinity-purified nuclear extract,. 100 of the 
{affinity-purified fraction was used in a procedure identical to 
that described above. 

I Chromatographic fractionations of HeLa nuclear extracts, 
(i) BioRex 70 chromatography. All buffers in this and subse- 
quent chromatography steps contained the protease inhibitors 
.at concentrations specified above in the preparation of the 
extracts. All chromatographic steps were carried out at 4°C. 
,The BioRex 70 resin (mesh size, 100/200) was preequilibrated 
; in buffer D (20 mM N-2-hydroxyethylpiperazine-A r '-2-ethane- 
Wfonic acid [HEPES; pH 7.9], 20% glycerol, 0.2 mM EDTA, 
,0.2 mM EGTA, 2 mM dithiothreitol) as recommended by the 
manufacturer (Bio-Rad). Fifty milligrams of HeLa nuclear 
extract prepared from A23 187- treated cells was cleared by 
Centrifugation at 10,000 x g for 10 min at 4°C. The proteins 
[were then loaded onto a 12-ml column at 0.1 M KC1, incubated 
"for 15 min, and then eluted with a KC1 step gradient (0.2, 0.3, 
0.4, 0.6, 0.8, and 1.0 M) in buffer D at 15 ml/h. The 0.4 M 
faction's containing IF activity were pooled and dialyzed into 
"buffer D with 0.1 M KC1. The concentration of the dialyzed 
proteins was determined by the Bio-Rad assay, with BSA as a 
^standard. The protein profile was visualized by Coomassie blue 
f taining of SDS-polyacrylamide gels. 

p (ii) DNA-cellulose chromatography. Double-stranded calf 
thymus DNA cellulose was obtained from Sigma. The material 
W preequilibrated in buffer D with 0.1 M KC1. The 0.4 M 
BioRex dialysate (2.1 mg) was applied onto a 9-ml column at 
.a flow rate of 3 to 4 ml/h. The column was eluted with a step 
KC1 gradient (0.2, 0.3, 0.4, 0.6, and 1.0 M) in buffer D. The 0.3 



protein profiles' were^swalu^' by silver "staining. 

(iii) grp core affinirychromatography. The grp78 core syn- 
thetic oligonucleotides were gel purified as described above; 
the complementary strands were reannealed and ligated as 
described previously (13).~The oligomers (l-mg)-were coupled 
to 2 g of cyanogen bromide-activated Sepharose 4B (Pharma- 
cia, Piscataway, N.J.). The coupling of the oligomers to the 
resin and termination were performed as described previously 
(13 49) and the resin was stored in the column storage buffer 
(lOmM Tris [pH 7.6], 1 mM EDTA, 0.3 M NaCl, 0.02% NaN 3 ) 
at 4°C. The 0.3 M fractions containing the II' binding activity 
from the calf thymus DNA-cellulose column were pooled, 
dialyzed into buffer D with 0.1 M KC1, incubated with 2 ng of 
sonicated poly(dl-dC) per uJ of extract, and applied onto a 
1-ml (bed volume) affinity column. The column was eluted with 
a step gradient of KC1 (0.1, 0.2, 0,4, 0.6, and 1.0 M) in buffer D 
containing 3 mM n-octyl glucoside (40) at a flow rate of 9 ml/h. 
The IP binding activity was recovered in 0.4 M KC1 eluate. The 
0.4 M eluate was dialyzed against buffer D containing 0.1 M 
KC1 and purified further by an additional cycle of core affinity 
chromatography. The purified fractions were stored in aliquots 
at -80°C. The protein profiles of the purified fractions were 
determined by silver staining, and the concentration was 
estimated by comparison against molecular weight markers. 

RESULTS 

Identification of constitutively associated transcription fac- 
tors on the grp78 promoter. In vivo genomic footprint analysis 
was used to identify which DNA sequences in the grp78 
promoter were constitutively associated with transcription 
factors. To accomplish this, we compared the methylation 
pattern of DNA isolated from non-heat-shocked cells with the 
pattern of deproteinized DNA methylated in vitro, i.e., naked 
DNA (Fig. 1). Protection against DMS reactivity mapped to 
guanine residues at positions -226, -224, -223, -207, -166 
to -162, -160, -154, -152, -149, -125, and -89 on the top 
strand and at positions -182, -181, -170, -161, -156, -150, 
-126, -94, -93, -61, -60, -47, and -46 on the bottom 
strand. In addition, certain sites exhibited increased DMS 
reactivity; these included G-229, G-206, G-159, G-157, and 
A-87 on the top strand and G-53 and G-63 on the bottom 
strand. These alterations localize to the Spl, CCAAT, and 
CREB-like sites present in the human grp78 promoter and 
support earlier studies on the role of these sites for basal 
expression of grp78 (47). 

Transcriptional activation of grp78 gene is accompanied by 
changes in protein-DNA interaction. We next compared the 
grp78 promoter sequences in cells treated with A23187 for 3 h, 
treated with tunicamycin for 6 h, or exposed to a 42°C heat 
shock for 4 h. As shown in Fig. 2A, transcription of the grp78 
gene as measured by nuclear run-on analysis was induced. 
After exposure of the cells to A23187, tunicamycin, or heat 
shock, parallel cell samples were treated with DMS in vivo and 
the methylation pattern of genomic DNA was analyzed by 
genomic footprinting. The most prominent inducible alter- 
ations in DMS reactivity patterns in the grp78 promoter can be 
detected at G-114, G-113, G-109, and G-75 on the coding 
strand and at G-112, G-lll, and G-108 on the noncoding 
strand (Fig. 2B). Minor changes at G-104 and G-105 on the 
coding strand were also observed. The majority of changes in 
in vivo occupancy occur in a region of nucleotides -114 to 
-104, suggesting that the transcriptional induction of grp78 is 
accompanied by factor binding to the sequence or that there 
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FIG. 1. Constitutive protein-DNA interactions within the human grp 78 promoter. Methylation patterns of the guanine (G) residues w^^Sli j 
analyzed by in vivo genomic footprinting and are presented for the coding and noncoding strands. Genomic DNA was isolated from control (Q^^|^ 
no heat shock, A23187, or tunicamycin treatment) HeLa cells that were treated with DMS in vivo. Lane N contained naked (deproteinized) DNA '|||p}f 
that was treated with DMS in vitro. Long arrows indicate the G residues that are prominently protected from methylation, short arrows indicate 
partial protection, and asterisks indicate the G residues that are hypersensitive to methylation. The putative factor (C1F, C3F, CREB-like, and Spl) J|||f 
binding sites are bracketed. 



are changes in protein conformation occupying this site. The 
kinetics of inducible factor interactions correlated closely with 
the transcriptional activation and attenuation of the grp78 
gene: the DMS methylation pattern in DNA isolated from cells 
that have recovered from the A23187 or tunicamycin treat- 
ment resembles the pattern seen in control cells, as does the 
level of grp78 transcription (Fig. 2B, lanes R). In contrast to the 
transient nature of inducible factor interactions in the grp78 
promoter, the basal interactions located in the GC box, 
CCAAT box, and CREB-like elements were not affected by 
treatment with A23187 or tunicamycin. A summary of in vivo 
sites of factor interaction detected either in control or the 

induced state is shown in Fig. 3. . - . « 

' Binding of complex II' to the core region important for 
A23187 induction. The cluster of changes in factor occupancy 



conserved among grp promoters. A comparison of the numai Qlllp| 
and rat grp core regions is shown in Fig. 4 A. Within the 35 bp»|||||| 
there are only two base pair changes. The bases which showed 
changes in occupancy after stress are perfectly conserved^^^ 
between the two species. The rat core element has been further- 
dissected into two functional domains (22). The 5'-half regioS 
contains a CCAAT-like motif and is important for basal-level 
expression, and the 3'-half region is important for A23187 anjj 
thapsigargin inducibility (19, 22). The same region defined 
through mutagenesis as important for stress induction a" 
contained the sequence where changes in factor occupai 
were observed. Next, we used gel mobility shift assays 
determine whether we can detect in HeLa nuclear extraj 
binding activities which are specific for 3' - half of the 'con 
Using nuclear extracts prepared from A23187-induced 




B 

CODING STRAND 



NON-CODING STRAND 



N C A3 R T6 R HS 




N C A3 R T6 R HS 



SIR 

G-149 



G-130 
G-125 



G-I14 
-G-113 

-G-109 



8: 



G-89 
A-87 




^?wf^ if™ 

■ 3f& 



-G-108 
• 112 



C-126 



1 FIG 2 (A) Transcription activation of the grp78 gene during A23187 and tunicamycin treatments or heat shock of HeLa cells. Nuclear run-on 
analyses of hsp70, hs P 90a, grp78, hsc70, and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) transcription were performed at the indicated 
time after addition of A23187 and tunicamycin or exposure to 42°C (heat shock [HS]). Vector denotes plasmid pGEM2. Lanes R represent cells 
"that after being treated with A23187 and tunicamycin were washed and allowed to recover for 6 h before the nuclei were isolated. (B) In vivo 
genomic footprint of the core region of the human grp78 promoter. After DMS treatment in vivo, genomic DNA was isolated from the parallel 
samples that were used for the nuclear run-on analysis, and methylation patterns of G residues were compared between control cells (C) and cells 
that were exposed to A23187 (A), tunicamycin (T), or heat shock (HS) treatment. Arrows indicate the G residues that are protected from 
methylation, and asterisks indicate the G residues that are hypersensitive to methylation. 



demonstrated that a Ku autoantigen found in high abundance 
hi HeLa nuclear extracts has a high affinity for the core binding 
site and is a major component of complex II (22). With 
modifications in the gel mobility shift assay and an extended 
electrophoresis running time, we discovered a faster-migrating 
complex, II\ which can be resolved from the major complex II. 
I To dissect the binding domains of the four complexes, 
competition assays were performed with the synthetic oli- 
gomers of the rat grp core mutated at either the 5' or 3' half 
(Fig. 4B). To control for nonspecific binding, a random 
synthetic oligomer bearing the mutated sequences of both the 
5' and 3' halves was used. With the exception of the mutated 
Bases, the sizes and the linker tails of all of the synthetic 
Jigomers are identical. As shown in Fig. 4C, upon competition 
;ith 100- and 200-fold molar excesses of the synthetic oli- 
gomers described above, complex II' corresponds to the com- 



plex which exhibits high affinity for the grp core 3'-half region, 
as complex II' was unaffected by the synthetic oligomers with 
the 3'-half mutation or the random sequence competitors but 
was efficiently eliminated by the wild-type core or the 5 '-half 
mutation competitors. In contrast, complex I and the majority 
of complex II appeared to be nonspecific, and complex III 
could be competed for equally by both the 5' and 3' halves of 
the core. 

In vitro core binding activities in control and induced HeLa 
nuclear extracts. To test whether the in vivo protection pattern 
changes correlate with changes of in vitro binding for the grp 
core, nuclear extracts were prepared from spinner cultures of 
exponentially growing HeLa cells maintained in normal culture 
medium or treated with the ionophore A23187 for 5 h. RNA 
blot analysis of these cells routinely showed a 10-fold increase 
in grp78 mRNA levels in the A23187-treated cells (Fig. 5A), 
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FIG. 3. Schematic presentation of the human grp78 promoter containing the consensus sequences and the respective transcription factors 
bound to these sites. Closed arrows and stars denote constitutive protections and hypersensitive sites of the DMS methylation of G residues 
respectively, as described in the legend to Fig. 1, and open arrows and stars denote inducible changes in the methylation pattern. 



demonstrating that the control cells were not subjected to ER 
stress during culture or harvest. The rRNA profile was used as 
a control for even loading of mRNA. With these nuclear 
extracts and the rat grp core sequence as the probe in gel 
mobility shift assays, the binding activities of the four com- 
plexes (I, II, IF, and III) from the control and A23187 extracts 
were indistinguishable (Fig. 5B). 

To characterize further the kinetics of formation of these 
complexes from control and induced extracts, the rates of 



association and dissociation of the complexes were measured. 
We observed that the formation of all four complexes was 
rapid such that steady-state levels of the binding activities were 
reached within 5 min of incubation of the nuclear extracts with 
the core probe (data not shown). In dissociation rate measure- 
ments, the extracts were preincubated with the core probe to 
allow the formation of the complexes prior to the addition of 
a 50-fold molar excess of the unlabeled core oligomers for 
various times (5, 15, 30, and 40 min) after the preincubation 
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FIG. 4. Binding specificity of complex II' to 3' half of the grp core. (A) Conservation of the core sequences between human and rat | 
relative position of the core sequence within each promoter is indicated. The divergent base pairs between the two core sequences^ejfi 
The bases which were protected in vivo in control cells are denoted by solid arrows; those bases "which exhibited change in site* ; BccupMC^ 
stress iriducticm~are marked by open arrows (protection) and open stars (hypersensitivity). (B) Sequences of ^ 

competitors. The wild-type sequence of the rat grp 78 core is shown in capital letters. Bold italic letters ; represent the" mutated seqUen^^ 
-lowercase letters indicate the linker seauences. (C) Combetitibn studies usine the "core 6^^ 
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Wiod. The results of these experiments (Fig. 5C) revealed 
that complexes I and II were highly stable. Once they were 
formed, there was no exchange with the unlabeled competitors 
during the 40-min incubation period. In contrast, complexes 11 
and III exhibited a rapid dissociation rate. Within 5 min ot 
incubation with the unlabeled competitor, complex II' was no 
longer detected. Incubation of unlabeled competitors at a 
100-fold molar excess for as short as 1 min also effectively 
eliminated complex IF (data not shown). For both control and 
induced extracts, the dissociation rates for complexes II and III 
were extremely rapid, whereas those for complexes I and II 
'We slow. Since complex II' binds with the highest specificity 
*to the 3' half of the core, which is functionally most significant 
^because of its contribution to stress induction, we focused on 
'identifying the protein component of complex II' which binds 
ito this DNA region. 

\ Complex II' can be reconstituted from renatured protein 
species of 65 to 75 kDa. Using UV cross-linking techniques, we 
identified the sizes of several polypeptides (210, 110 90/92, and 
! 70 kDa) from HeLa nuclear extracts which could be photola- 
Wled within the major complex II (including that of II'), .with 
'a 70-kDa protein species being the most predominantly labeled 
Lband (22). As the first step to ascertain whether any of the 
protein species identified above has the ability to bind directly 
to the grp core, HeLa nuclear extracts prepared fromA^lS?- 
^treated cells were size fractionated in preparative SDS-poly- 
crylamide gels. The gels were cut into 1-cm slices, and the 
iroteins, after electrocution, were allowed to renature. The 
denatured proteins were then used in gel mobility shift assays 
using the rat grp core as the probe. Examples of the gel 
mobility shift assays performed with renatured proteins recov- 
ered from the molecular size ranges of 115 to 140, 90 to 115, 
and 65 to 75 kDa are shown in Fig. 6. Using the regular binding 
uffer which contains Mg 2+ as the divalent ion, the only 
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FIG 6 Renaturation of complex II' binding activity. Representa- 
tive renatured fractions from regions of the denaturing SDS-polyacryl- 
amide gel were assayed for core binding activities. Each react.on 
mixture contained the rat grp78 core probe and either 0.1 N> of 
induced nuclear extract (NE; lane 1), 2 jtl of renatured fractions (lanes 
2 to 4) or probe only (lane 5) and was incubated on ice for 2h. The 
positions of the complexes and free probes (F) are indicated. The size 
range of proteins from each renatured fraction is indicated in kilodal- 
tons at the top. 
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FIG 7 Specificity of the renatured complex II' activity from p(65-75). (A) Gel shift analysis shows that p(65-75) is devoid of *u ^vity.Inj. 
lanes 1 to 7, each reaction mixture contained the rat OT 7S core probe, 0.1 »g of induced nuclear extract, and the following: ™ compears ane^ 
1) 100-fold molar excess of unlabeled core oligomers (lane 2), 100-fold molar excess of random competitors (lane 3), 0.1 to 0.05 uJ of ™}***^ 
amiserum (aku- lines 4 and 5), or 0.1 to 0.05 u/of an unrelated anti-E2F antiserum (ot-E2F; lanes 6 and 7). In lanes 8 to 12, each reaction mixture^, 
S to?T*w78 core probe, 2 p.1 of the renatured p(65-75) fraction, and the following: no antiserum (lane 8) or 0.1 to 0.05 |xl 

andW) ofL-E2F (lanes 11 and 12) antiserum. The antibodies were preincubated with the nuclear extract for 3 to 5 mm ^foreadd p|g 
of the probe All reaction mixtures were incubated further on ice for 20 min. (B) Specificity of the renatured p(65-75) binding activity. Compctit^g., 
studies were performed with a fivefold molar excess of competitors depicted in Fig. 4B. The positions of the free probe (F) and complex 
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complex which could be reconstituted was complex II', and 
that was achieved with renatured proteins in the size range of 
65 to 75 kDa. In an attempt to stabilize any DNA-binding 
factor which may require Zn 2+ for renaturation and binding, 
we tested whether addition of Zn 2+ to the binding buffer could 
enhance binding to the core from any of the fractions. We 
observed that the binding activities were only slightly enhanced 
by the presence of Zn 2+ (data not shown). These results with 
the unfractionated HeLa nuclear extract indicate that the 
protein species between 65 and 75 kDa, referred to below as 
p(65-75), has the ability to bind to the grp core after renatur- 
ation. While complex II' is most readily reconstituted from 
renatured protein fractions, it is possible that there are other 
species which also bind to the core probe but were hot able to 



tigen binding activity and a core-specific binding activity (22).WM 
Ku binds to double- and single-stranded DNA ends with hig^ 
affinity and random sequence specificity (1). The Ku autoafc^ 
tigen consists of a 70-kDa DNA-binding component an|J|» 
80-kDa component. Since the protein species that we idelnti^ 
fled which can reconstitute complex II' are similar in moletuOa^ 
size to the DNA-binding component of Ku, it is critical 
establish that p(65-75) is not Ku or contaminated with it^ 
this purpose, we used two approaches to test the identity J 
specificity of p(65-75). The first approach was to test wgetl 
p(65-75) can react with the antibody against Ku and, in^ 
so, affect complex II' formation with the grpcore, ;As sJjjj 
Fig. 7 A, the abundant complex II formed with the uffi|< 
ated HeLa extract was largely supershtfted 





h™ni^, n !f t08r h aphiC ^ UrifiCati0n °/ COm ? leX binding 3C,ivit y and identification of p70CORE as the core-binding polypeptide (A) 
lumn punfication schemejhe activity of complex II' was monitored by gel shift assays. The salt concentrations at which II' activity eluted from 

^aS^l^S^^^?^ T Pr ° ffi fi,e rf ^W-iW II' binding activity. Lane 1, unfK&XlS 

ract (rvifc,), lane 2, buffer alone; lane 3, 0.4 M eluate from affinity-purified fraction (Aff.); lane 4, molecular size markers (m The hrartPtPH 

^^^T^J^^T^^ of the affinity-purified fraction. Each reaction mixture contained the "JgrplS core probe and Xer 

KS^SiT ^ ^ (Aff °' ° r 20 ^ ° f renatUred fraCti0ns ^ throu 8 h 16 ' the numbers represent the ge 

ces in the direction of decreasing molecular weight). The reactions were incubated on ice for 2 h before electrophoresis at 4°C After 
ectrophoresis, the dried gel was exposed for 7 days at -70°C The dark spot in the middle of lane 4 is an artifact f^o^^p^iSi 



[ preincubation with either the anti-Ku antigen or the anti- 
F antibody. These results demonstrate that p(65-75) is not 
u and is not contaminated with it. 

:s Next we determined whether p(65-75), after isolation from a 
featuring gel and subsequent renaturation, retained its spec- 
city for the 3' half of the grp core. For this purpose, 
mpetition assays with various synthetic oligomers (Fig. 4B) 
pre performed with the reconstituted complex II' (Fig. 7B). 
e observed, as in the case of the complex IF formed from the 
nfractionated extract (Fig. 4C and 7A, lanes 1 to 3), that the 
"onstituted complex IF is highly specific for the 3' half of the 
e sequence. Only the wild-type core oligomer and the 5' 
mated oligomer, containing the wild-type 3 '-half core se- 
ance, were able to abolish complex IF. The addition of the 
^mutated core oligomer and two other heterologous compet- 
~rs had no effect. The specificity of the renatured complex 
*-s retained when up to 200-fold molar excesses of competi- 
. i were used (data not shown). 

Purification of complex IF binding activity from A23187- 
rf ""ed HeLa extracts. With the establishment of a gel 
ity shift assay which allows us to monitor the complex IF 
lg activity, . the induced HeLa nuclear extract was sub- 
. ! to a series of chromatographic steps for the purpose of 
. fying p(65-75), which binds to the 3' half of the grp core 
ni gh specificity and affinity, using the purification scheme 
*n in Fig. 8A. 

Jxrnt 50 mg of HeLa nuclear extract was first applied onto 
ioRex 70 column, which is a high-capacity, weak cationic 
: pange column. Upon step elution with increasing concen- 
>tions of KC1, the complex IF binding activity, as detected by 
mobility shift assays, was enriched in the 0.4 M fractions. 



The active fractions were then applied onto a double-stranded 
calf thymus DNA-cellulose column to resolve the DNA- 
binding proteins on the basis of their relative affinities for 
general DNA. This column was useful in separating the 
complex II binding activity from the complex IF binding 
activity, which eluted primarily in the 0.3 M fraction. Complex 
II (mostly Ku) was enriched in later elution fractions (data not 
shown). Finally, the complex IF active fractions were applied 
onto a Sepharose 4B column coupled with the concatenated 
core oligomers. The bound proteins were eluted with a step- 
wise KCI gradient, and complex IF was recovered in the 0.4 M 
fraction. As shown in Fig. 8C, the affinity-purified fraction in 
gel mobility shift assays formed only complex IF with the grp 
core probe. Thus, the multiple chromatographic fractionation 
steps were successful in segregating the components compris- 
ing complexes II and IF. A comparison of the silver stain 
patterns of the unfractionated extract and the affinity-purified 
fraction enriched for complex IF is shown in Fig. 8B. For the 
affinity-purified fraction, there were several groups of protein 
bands, including those with sizes in the ranges of 120, 97, and 
70 kDa, which were enriched. A lane consisting of the buffer 
alone was included to account for the cluster of nonspecific 
bands below the 68-kDa size marker. Based on their relative 
specific DNA binding activities, the enrichment for complex IF 
binding activity at this final stage of chromatographic purifica- 
tion is about 10,000-fold (Table 1). 

Identification of p70CORE as the DNA-binding component 
of complex IF. Using unfractionated HeLa nuclear extract, we 
demonstrated above that the protein component of complex 
IF has a molecular size of about 70 kDa. Upon chromato- 
graphic purification, a set of proteins of about 70 kDa (referred 
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TABLE 1. Purification of p70CORE from HeLa nuclear extract 



B 



Fraction 


Protein 
(mg) 


Total 
activity 

(ur 


Sp act 
(U/ng) 


Purification 
(fold) 


Recovery 
<%) 


Nuclear extract 


50 


500,000 


10 






BioRex 70 


2.1 


400,000 


190 


19 


80 


Calf thymus DNA 


0.34 


288,000 


847 


85 


58 


2x core affinity 


0.002 


186,000 


93,000 


9,300 


37 



a One unit is denned as the amount of binding activity of 0.1 jxg of nuclear 
extract under normal binding conditions. 



to as p70CORE; bracketed bands in Fig. 8B) was also detected 
by silver staining, among other protein bands which were 
copurified. To determine whether p70CORE is the purified 
form of p(65-70) and whether other core-binding proteins were 
also purified, the affinity-purified fraction containing the com- 
plex II' binding activity was applied onto a preparative SDS- 
polyacrylamide gel to fractionate the proteins. Gel slices of 1 
cm each were cut along the whole length of the gel. After 
elution, the proteins were allowed to renature and used in gel 
mobility shift assays, using the rat grp core as the probe. The 
results are shown in Fig. 8C. Of all of the gel slices tested, only 
slice 9, containing p70CORE, was able to form a complex. The 
complex is II', as judged by its electrophoretic mobility. 

To confirm that the affinity-purified fraction and p70CORE 
retained the ability to bind to the 3' half of the grp core, 
competition assays with wild-type and mutated core oligomers 
were performed as described for Fig. 4C. As shown in Fig. 9 A, 
a side-by-side comparison of the electrophoretic mobility of 
complex IF formed by the affinity-purified fraction and that of 
the unfractionated nuclear extract confirmed that they were 
identical. Complex II' as formed by the affinity-purified frac- 
tion was specifically inhibited by the grp core oligomer but not 
by the random oligomer competitor. Whereas p70CORE after 
its isolation and renaturation exhibits a substantial loss of 
protein mass and binding activity (only — 1% recovery of 
binding activity), a side-by-side comparison of complex II' 
derived from the affinity-purified fraction and p70CORE 
showed that they migrated with identical electrophoretic mo- 
bilities (Fig. 9B). Further, the specificity of p70CORE for the 
3' half of the grp core was retained, as demonstrated by 
competitions with the wild-type and 5' mutated oligomers but 
not the 3' mutated or random oligomers at a 100-fold molar 
excess (Fig. 9B). 

To determine whether the affinity-purified fraction binds to 
the same bases which exhibit in vivo stress-inducible changes in 
methylation pattern, competition analyses with additional syn- 
thetic oligonucleotides were performed. The mutations, Ml 
through M4, each contained 4-bp substitutions within the 3' 
half region of the grp core (Fig. 10A). The complex II' formed 
by the affinity-purified fraction can be competed for only by the 
wild-type core and M4 oligomers, which mutated 4 bp outside 
the in vivo DMS footprint region.(Fig. 10B). The oligomers Ml 
and M2, which created base mutations within the DMS- 
protected region, were unable to compete efficiently even at a 
100-fold molar excess. The inability of M3 to compete suggests 
that bases undetectable by DMS protection in vivo in the 
immediate flanking region may also be involved in protein- 
DNA interaction. In comparison, the competitor with the least 
potency is 3 'mt, which mutated the entire 3' half of the grp 
core. These, combined results demonstrate that the affinity- 
purified, fraction binds to the same ; sites which exhibit stress-, 
induciblej changes • that jpTOCORE ■ as: 
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FIG. 9. Specificity of the p70CORE binding activity. (A) Specificity 
of the affinity-purified fraction as determined from competition stud- 
ies. Lane 1 contains the rat grp78 core probe mixed with induced 
nuclear extract (NE) and a 100-fold molar excess of random compet- 
itors. Lanes 2 to 4 contain the rat grp78 probe and 2 [i\ each of the 
affinity-purified II' binding activity (Aff.) and either no competitors 
(lane 2) or a 100-fold molar excess of unlabeled core (lane 3) or 
random (lane 4) competitors. (B) Specificity of the renatured 
p70CORE binding activity. Lane 1 is a longer exposure of lane 4 in 
panel A. Lanes 2 to 6 contain the rat grp78 core probe, 20 u,l each of 
renatured p70CORE and a 100-fold molar excess of the various 
competitors as indicated at the top. After electrophoresis, the dried gel i£ 
was exposed for 14 days at -70°C. The smearing effect observed in ' 
lane 2 of both panels A and B is due to the 12.5% glycerol in the final " 
40-jjl1 reaction volume. F, free probe. : { . '■ 



component of complex II' which binds specifically to the region, 
of the grp core required for stress induction. 



DISCUSSION 



The regulation of GRP78 in response to stress conditions .in^^ 
the ER represents a unique model for signal transduction. 'j^^^M 
the case of the mammalian grp genes, a depletion of ithe^^l 
calcium store from the ER or the accumulation of malfold^d^^g 
or underglycosylated proteins apparently generates a signal^ 
that is transmitted to the nucleus and leads to ermancetl* 
transcription of the grp genes. This transcriptional regulation^ 
mediated primarily by upstream promoter . sequences (19^32)[ 
While the sequence organization of the rat grp 78 promoteri^ 
complex and contains functionally ^ ; redundant;.elemente; s (f *" v 
the ^sequence jspan^ 
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Ml c t cgaGCCGCTTCGAATCGGC AG* t taCAGCTTGGTGGC AT 

CGGCGAAGCTTAGCCGTCtMtGTCGAACCACCGTAcagctg 

M2 c t eg aGCCGC TTCG AATCGGC AGC GGC * c t *TTGGTGGC AT 

CGGCGAAGCTTAGCCGTCGCCG tga tAACCACCGTAcagc tg 

M3 c t eg a GCCGC TTCG AATCGGC AGCGGC C AGC ggt tTGGCAT 

CGGCGAAGCTTAGCCGTCGCCGGTCGec«*ACCGTAcagctg 

M4 ct cgaGCCGCTTCGAATCGGCAGCGGCCAGCTTGGgt t«AT 

CGGCGAAGCTTAGCCGTCGCCGGTCGAACCcaafcTAcagctg 

3' mt ctcgaGCCGCTTCGAATCGGactatcgactagcttgctacg 

CGGCGAAGCTTAGCCtff«tagctg«tcg**Cff«taccagctg 




FIG. 10. Delimitations of the complex II' binding site with the affinity-purified fraction. (A) Sequences of the oligomers used as probes and 
competitors. The wild-type sequence of the rat grp78 core is shown in capital letters, with the bases which exhibited changes in site occupancy after 
tress induction marked by arrows (protection) and stars (hypersensitivity). Bold italic letters indicate the mutated sequences, and lowercase letters 
denote the linker sequence. (B) Competition of complex II'. Each lane contained 2 jjlI of the affinity-purified fraction (Aff.) with either no 
competitor (first lane) or increasing molar excesses (25-, 50-, and 100-fold) of the competitors indicated at the top. The autoradiograms are shown. 
The positions of complex IT and the free probe (F) are indicated. 



ranging from yeasts to humans (5, 32). This high degree of 
conservation is consistent with its functional significance in ER 
stress induction, as revealed by promoter mapping studies (22) 
and its ability to confer ER stress inducibility to heterologous 
romoters in both the mammalian and yeast systems (19, 28). 
Moreover, binding element titration in vivo reveals that the 
pore element is essential for stress induction of the grp genes 
and for cell survival (5, 20), and the subdomain responsible for 
Icium stress inducibility was mapped to the 3' half of the core 
egion (22). 

To understand how the core element mediates stress indue- 
on of grp78 transcription, we have previously performed gel 
"obility shift assays with a 291 -bp grp promoter fragment (7) to 
etermine if differences in factor binding could be detected in 
L L duced cell extracts compared with control cell extracts. In 

ose in vitro assays, the binding activities were similar, 
irrther, in yeast cells, the homologous grp core binding activity 
as also reported to be constitutive, and no differences be- 

een control and stressed cells were found (28). Using in vivo 



footprinting techniques to investigate protein-DNA interac- 
tions of the human grp78 promoter, we report here that specific 
changes in the methylation pattern such as protection or 
hypersensitivity of specific nucleotides were detected on the 
coding and noncoding strands within the 3' half of the grp core 
when the cells were stressed. Further, the changes in factor 
occupancy directly correlate with transcriptional induction of 
the endogenous grp78 gene. Nonetheless, the lack of in vivo 
protection in nonstressed cells was apparently not due to the 
absence of factors that are capable of binding to the highly 
conserved grp core, since extracts from induced and control 
cells bind with similar specificities and kinetics to the rat grp 
core, as analyzed in vitro by gel mobility shift assay. The lack of 
in vivo footprinting despite the presence of binding factors has 
been reported for the promoters of the tyrosine aminotrans- 
ferase (4), major histocompatibility complex class II (14), and 
(^-microglobulin genes (25). Thus, the grp78 gene provides an 
additional example wherein the presence of element-specific 
factors in control cells is not sufficient for producing in vivo 
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occupancy, whereas after stress treatment, the factors are able 
to access the DNA. One explanation is that a higher order of 
chromatin configuration, or some other inducible interacting 
factor(s), exerts control over the accessibility of the binding 
factor to the specific DNA elements upon stress induction. 
Since similar binding activities were detected in vitro from 
nuclear extracts of control and stress-induced cells, it is 
unlikely that the accessibility is controlled by nuclear transport 
of a cytoplasmic factor. The combined results of the in vitro 
and in vivo analyses underscore the importance of examining 
the DNA-protein interactions in vivo in the context of the 
chromatin, where subtle differences in factor binding would 
otherwise be undetectable in vitro. 

Further, we noted that despite the lack of a heat shock 
element, identical protections and hypersensitivities of guanine 
residues were seen in the human grp78 promoter from heat- 
shocked cells and those treated with A23187 or tunicamycin. 
The grp78 gene (KAR2) in yeast cells is heat shock inducible; 
however, this response is mediated by a heat shock element 
within the yeast KAR2 promoter (28). One explanation is that 
the kinetics of heat induction of grp 78 is distinct from the 
kinetics of other classical heat shock genes, e.g., hsp70 and 
hsp90, such that it requires continuous heat shock (4 h). Such 
prolonged heat treatment could result in a physiologically 
stressful condition similar to those elicited by A23187 and 
tunicamycin. These results indicate that several different stim- 
uli mediate their effects through common inducible protein- 
DNA interactions occurring at the 3' half of the grp core 
sequence. 

Given the importance of the grp core, we sought to identify 
the binding activity specific for 3' half of the core region. By 
using half-site mutants, we were able to define complex II' as 
the activity specific for 3' half of the core. Through kinetic 
studies, we determined that complex II' binding to the 3' half 
of the core is extremely rapid. At the same time, it dissociates 
from the core rapidly. This quick exchange rate does not 
compromise the stability of the complex II' over time since 5' 
mutant and random competitors do not compete even at a high 
molar excess. This unique property of complex II' may be 
favorable for the regulatory role that it portends. It may be one 
mechanism through which a gene can be quickly attenuated 
through a change in the accessibility of DNA. 

From fractionation of induced nuclear extract, we found that 
the complex II' activity may be reconstituted from renatured 
fractions of polypeptides in the range of 65 to 75 kDa. Since 
the Ku autoantigen has been previously shown to bind the grp 
core (22), it was necessary to determine whether the renatured 
activity was specific to the grp78 core and whether the rena- 
tured activity was from the abundant Ku autoantigen. The Ku 
autoantigen consists of a 70-kDa DNA-binding component 
and a 80-kDa component. While its binding specificity appears 
to be nonspecific, Ku was recently shown to be a substrate for 
a DNA-activated protein kinase (18). Interestingly, the tran- 
scription of grp78 was suppressed by protein kinase inhibitors 
(30, 33). Whether the binding of Ku to the grp core exists in 
vivo and whether it plays any specific role in activating the core 
complexes remain to be determined. Nonetheless, we demon- 
strate here that complex II' is distinct from the Ku autoanti- 
gen. Complex IT from renatured fractions exhibits the same 
electrophoretic mobility as that from the unfractionated nu- 
clear extract, suggesting that it is unlikely to consist of heter- 
ologous subunits outside the size range of 65 to 75 kDa. :p 

Since complex II' is highly specific for the 3' half of the core, 
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With the additional steps of core-specific DNA affinity chro-*J 
matography, we were able to recover an affinity-purified frac- % 
tion of about 10,000-fold purification. From silver stains, these £ 
fractions contained several major polypeptides. To determine 
which polypeptides bind specifically to the grp78 core, we 
renatured polypeptides of specific size ranges after SDS- 
PAGE. In agreement with results of a similar study using the 
crude nuclear extract, we observed that proteins in the size 
range of 65 to 75 kDa give renatured binding activity. In this 
range, the enriched protein bands were a cluster of several 
polypeptides with molecular sizes of about 70 kDa (p70CORE). 
By itself, p70CORE reconstitutes the complex IF binding 
activity and exhibits binding specificity. Through in vitro com- 
petition with the affinity-purified fraction, complex II' interacts 
with the same bases for which inducible changes are observed 
in vivo. The identification of the p70CORE which binds to the 
region of the core responsible for inducible expression is an 
important first step toward characterization of the protein 
involved in and understanding the mechanism of the induction 
of grp78. 

Recently, a transmembrane serine/threonine kinase, IREp, 
has been shown in yeast cells to be essential in the induction by 
malfolded protein accumulation in the ER (6, 27). It has been 
postulated that IREp, which spans the ER membrane, trans- 
mits a signal from the ER to the nucleus. However, the ligands 
of IREp, as well as its substrates, are unknown. For the 
mammalian grp system, there are reports that its induction is 
sensitive to W7, a serine/threonine kinase inhibitor (33), and 
enhanced by okadaic acid, which is a serine/threonine phos- 
phatase inhibitor (31). However, genistein, a protein tyrosine 
kinase inhibitor, is also reported to inhibit grp78 induction 
(31). Thus, the pathways which regulate mammalian grp genes 
may be more complex than those in yeast cells. Assuming that 
a mammalian homolog of IREp does exist, it will be important 
to determine the relationship between p70CORE and this 
kinase and identify the intermediary signal transduction steps 
between the ER and the nucleus. 
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